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(54) Titte: COMPLEX COMBINATORIAL CHEMICAL LIBRARIES ENCODED WITH TAGS 
(57) Abstract 

Encoded combinatorial chemistry is provided, where sequential synthetic schemes are recorded using organic molecules, which 
define choice of reactant, and stage, as the same or different bit of information. Various products can be produced in the multi-stage 
synthesis, such as oligomers and synthetic non-repetitive organic molecules. Conveniently, nested families of compounds can be employed 
as identifiers, where number and/or position of a substituent define the choice. Alternatively, detectable functionalities may be employed, 
such as radioisotopes, fluorescers. halogens, and the like, where presence and ratios of two different groups can be used to define stage or 
choice. Particulariy, pluralities of identifiers may be used to provide a binary or higher code, so as to define a plurality of choices with 
only a few detachable tags. The particles may be screened for a characteristic of interest, particularly binding affinity, where the products 
may be detached from the particle or retained on the particle. The reaction history of the particles which are positive for the characteristic 
can be detennined by the release of the tags and analysis to define the reaction history of the particle. 
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COMPLEX COMBINATORIAL CHEMICAL LIBRARIES ENCODED WITH TAGS 

This application is a continuation-in-part of U.S. Serial 
No. 08/227,007, filed April 13, 1994 the contents of which 
are hereby incorporated by reference into the subject 
application. 

5 

Introduction 
Technical Field 

The field of this invention concerns combinatorial 
chemistry which involves syntheses having a plurality of 
10 stages, with each stage involving a plurality of choices, 
where large numbers of products having varying 
compositions are obtained. 

Background of the Invention 

15 There is substantial interest in devising facile methods 
for the synthesis of large numbers of diverse compounds 
which can then . be screened for various possible 
physiological or other activities. Typically such a 
synthesis involves successive stages, each of which 

20 involves a chemical modification of the then existing 
molecule. For example, the chemical modification may 
involve the addition of a unit, e.g. a monomer or synthon, 
to a growing sequence or modification of a functional 
group. By employing syntheses where the chemical 

25 modification involves the addition of \inits, such as amino 
acids, nucleotides, sugars, lipids, or heterocyclic 
compounds where the units may be naturally- occurring, 
synthetic, or combinations thereof, one may create a large 
number of compounds. Thus, even if one restricted the 

30 synthesis to naturally-occurring units or building blocks, 
the number of choices would be very large, 4 in the case 
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of nucleotides, 20 in the case of the common amino acids, 
and essentially an unlimited number in the case of sugars . 

One disadvantage heretofore inherent in the production of 
5 large numbers of diverse compounds, where at each stage of 
the synthesis there are a significant number of choices, 
is the fact that each individual compound will be present 
in a minute amount. While a characteristic of a 
particular compound, e.g. a physiological activity, may be 
10 determinable, it is usually impossible to identify the 
chemical structure of the particular compound present . 

Moreover, physiologically-active compounds have 
historically been discovered by assaying crude broths 

15 using Edisonian or stochastic techniques, where only a 
relatively few compounds are assayed at a time, or where 
a limited number of structurally similar homologs of 
naturally -occurring physiologically-active compounds are 
assayed. Two major problems have been associated with the 

20 use of such crude broths, namely, the necessity to purify 
the reaction mixture into individual component compounds 
and the time-consuming effort required to establish the 
structure of the compound once purified. 

25 To address these disadvantages and problems, techniques 
have been developed in which one adds individual units as 
part of a chemical synthesis sequentially, either in a 
controlled or a random manner, to produce all or a 
substantial proportion of the possible compounds which can 

30 result from the different choices possible at each 
sequential stage in the synthesis. However, for these 
techniques to be successful it is necessary for the 
compounds made by them to be amenable to methods which 
•will allow one to determine the composition of a 

35 particular compound so made which shows a characteristic 
of interest. 
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One such approach involves using a chip which allows for 
separate analysis at physically separate sites on the 
surface of the chip (Fodor et al., Science 251: 767 
[1991] ) . By knowing which reactant is added sequentially 
5 at each such site, one can record the sequence of events 
and thus the series of reactions. If one then subjects 
the chip to a screening method for a particular desired 
characteristic and detects the characteristic one can 
readily determine the compound synthesized at the site 
10 which demonstrates that characteristic. 

Another such technique involves the theoretical synthesis 
of oligonucleotides in parallel with the synthesis of 
oligopeptides as the compounds of interest (Brenner and 
15 Lemer, PNAS USA [1992] 81: 5381-5383) . 

Further techniques are also disclosed in the following 
publications: Amoto, Science (1992) 257/ 330-331, 
discusses the use of cosynthesized DNA labels to identify 

20 polypeptides. Lam, et al., i^ature (1991) 354/ 82-84, 
describe a method for malcing large peptide libraries. 
Houghton, et al., Nature (1991) 354, 84-86, and Jung and 
Beclc-Sic]cinger, Angew. Chem, Int. Ed. Engl. (1992) 91, 
367-383, describe methodology for making large peptide 

25 libraries. Kerr et al ■ . J. Amer. Chem. Soc, (1993) 115 , 
2529-31, teach a method of synthesizing oligomer libraries 
encoded by peptide chains . Finally, international 
applications WO 91/17823 and WO 92/09300 concern 
combinatorial libraries . 

30 

However, since methods such as the preceding typically 
require the addition of like moieties, there is 
substantial interest in discovering methods for producing 
compounds which are not limited to sequential addition of 
35 like moieties. Such methods would find application, for 
example, in the modification of steroids, antibiotics, 
sugars, coenzymes, enzyme inhibitors, ligands and the 
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like, which frequently involve a multi-stage synthesis in 
which one would wish to vary the reagents and/or 
conditions to provide a variety of compounds. In such 
methods the reagents may be organic or inorganic reagents, 
5 where functionalities may be introduced or modified, side 
groups attached or removed, rings opened or closed, 
stereochemistary changed, and the like. (See, for example, 
Bunin and Ellman, JACS 114, 10997 [1992].) For such a 
method to be viable, however, there needs to be a 

10 convenient way to identify the structures of the large 
number of compounds which result from a wide variety of 
different modifications. Thus, there is a need to find 
a way whereby the reaction history may be recorded, and 
desirably, the structures of the resultant compounds 

15 identified. 

Finally, as the size of a library of compounds so 
synthesized increases, known technicjues of structure 
elucidation and product segregation introduce substantial 

20 inefficiencies and uncertainties which hinder the accurate 
determination of the structure of any compound identified 
as being of interest. Thus, there is a substantial need 
for new methods which will permit the synthesis of complex 
combinatorial chemical libraries which readily permit 

25 accurate structural determination of individual compounds 
within the library which are identified as being of 
interest . 

Many of the disadvantages of the previously-described 
30 methods as well as many of the needs not met by them are 
addressed by the present invention which, as described 
more fully hereinafter, provides myriad advantages over 
these previously-described methods. 
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Sumtnarv of The Invention 

Methods and compositions are provided for encoded 
combinatorial libraries, whereby at each stage of the 
synthesis, a support, such as a particle, upon which a 
5 compound is being synthesized is uniquely tagged to define 
a particular event, usually chemical, associated with the 
synthesis of the compound on the support. The tagging is 
accomplished using identifier molecules which record the 
sequential events to which the supporting particle is 
10 exposed during synthesis, thus providing a reaction 
history for the compound produced on the support. 

Each identifier molecule is characterized by being stable 
under the synthetic conditions employed, by remaining 

15 associated with the supports during the synthesis, by 
uniquely defining a particular event during the synthesis 
which reflects a particular reaction choice at a given 
stage of the synthesis, by being distinguishable from 
other components that may be present during assaying, and 

20 by allowing for detachment of a tag component which is 
discernible by a convenient, analytical technique. 

The identifiers of this invention are used in combination 
with one another to form a cinary or higher order encoding 
25 system permitting a relatively small number of identifiers 
to be used to encode a relatively large number of reaction 
products. For example, when used in a binary code N 
identifiers can uniquely encode up to 2^ different 
compounds . 

30 

Moreover, the identifiers of this invention need not be 
bound serially through a previous identifier but rather 
are individually bound to the substrate, either directly 
or through the product being synthesized. The identifiers 
35 are not sequencable. Furthermore, the identifiers contain 
a cleavable member or moiety which permits detachment of 
a tag component which can be readily analyzed. 
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Conveniently, the combinatorial synthesis employs 
definable solid supports upon which reactions are 
performed and to which the identifiers are bound. The 
individual solid supports or sxabstrates carrying the final 
5 product compounds may be screened for a characteristic of 
interest and the reaction history determined by analyzing 
the associated identifier tags. 
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Brief Description of the Drawings 

Figure 1 illustrates the analysis of tag 4 by mass 
spectroscopy. Two signals corresponding to tag 4 were 
observed . 

5 

Figure 2 illustrates the analysis of tag 11 by mass 
spectroscopy. Two signals corresponding to tag 11 were 
observed , 

10 Figure 3 illustrates the analysis of tag 13 by mass 
spectroscopy. Two signals corresponding to tag 13 were 
observed. 

Figure 4 illustrates the analysis of tags 4, 11 and 13 by 
15 positive chemical ionization mass spectroscopy (PCIMS) 
when approximately ec[ual amounts of each tag were mixed 
together. Two signals corresponding to each separate tag 
could easily be distinguished, 

20 Figure 5 illustrates the mass spectrum and chromatogram 
for the underivativzed d-7 sample only. 

Figure 6 illustrates the mass spectra and chromatograms 
showing the improvement in chromatography due to 
25 derivatization of tags, a d-5 and d-7 mixed sample are 
assessed. 
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DETAILED DESCRIPTION OF THE INVENTION 

As used in this application the term "tag" or "T" means a 
chemical moiety which possesses two properties. First, it 
is capable of being distinguished from all other chemical 
5 moieties. Second, it is capable of being detected when 
present at 10"" to 10'^ mole. These two properties may be 
embodied in a single chemical structure. Alternatively, 
these properties may be embodied in separate chemical 
structures which are linked together. In this latter 

10 case, one of the chemical structures, which may be 
designated C (or in the case of more than one such 
structure C, C' , etc . ) provides the property of rendering 
the tag distinguishable from other tags while the. other 
chemical structure, E, provides the property of rendering 

15 the tag detectable and optionally may provide the property 
of rendering the tag separable from other tags. 

As used in this application, the term "linker" or "L" 
means a chemical moiety which possesses three properties. 

20 First, it is attachable to a solid support. Second, it is 
attachable to a tag. Third, when it is attached to both 
a solid support and a tag, it is cleavable such that the 
tag may be released from the solid support. These three 
properties may be embodied in a single chemical structure. 

25 Alternatively, these properties are embodied in three 
chemical structures which are linked together. In this 
latter case one of the chemical structures, which may be 
designated F^, provides the property of rendering the 
linker attachable to the solid support; the second 

30 chemical structure, which may be designated V, provides 
the property of rendering the linker cleavable; and the 
third chemical structure which may be designed A' , 
provides the property of rendering the linker attachable 
to the tag. Desirably, the chemical structures V and A' 

35 are one and the same, in which case V-A' may be designated 
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As used in this application, the term "identifier" means 
a chemical entity which includes both a tag and a linker. 
ThuB/ in the broadest sense an identifier may be 
represented by the formula L-T while specific embodiments 
5 of the identifier may be represented by the formulae F^-V- 
A' -T; F^-V-A' -C-E (or F^-V-A' -E-C) ; L-C-E (or L-E-C) ; and 
L-C-E-C' . 

As used in this application, the term "bound identifier" 
10 means an identifier attached to a solid support. 

As used herein, the term "choice" means the alternative 
variables for a given stage in a combinatorial synthesis, 
such as reactant, reagent, reaction conditions, and 
15 combinations thereof. The term "stage" corresponds to a 
step in the sequential synthesis of a compound or ligand; 
the compound or ligand being the final product of a 
combinatorial synthesis . 

20 The term "alkyl" includes linear, branched, and cyclic 
structures emd combinations thereof. Thus, the term 
includes methyl, ethyl, propyl, isopropyl, butyl, sec- and 
tert -butyl, cyclopropyl, cyclobutyl, cyclopentyl, 2- 
methylcyclopropyl , and the like. Lower alkyl is Ci-Cg 

25 alkyl- Lower alkenyl is Cj-Cg alkenyl of a linear, 
branched, or cyclic configuration and combinations 
thereof . 

Unless otherwise indicated, it is intended that the 
30 definitions of euxy substituent ( e.g. . R^, Z, etc.) in a 
particular molecule be independent of its definitions 
elsewhere in the molecule. Thus, NR*R^ represents NHH, 
NHCH3, NHCH2CH3, N (013)2, etc. 

35 Some of the compounds described herein contain one or more 
centers of asymmetry and may thus give rise to 
enantiomers, diastereoisomers, and other steroisomeric 
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forms. The present invention is meant to include all such 
possible stereoisomers as well as their racemic and 
optically pure forms. Optically active (R) and (S) 
isomers may be prepared using chiral synthons, chiral 
5 reagents, or resolved using conventional techniques. When 
the compounds described herein contain olefinic double 
bonds, it is intended to include both E and 2 geometric 
isomers . 

10 The materials upon which the combinatorial syntheses of 
this invention are performed are referred to herein 
interchangeably as beads, solid surfaces, (solid) 
substrates, particles, supports, etc. These terms are 
intended to include: 
15 a) solid supports such as beads, pellets, disks, 

capillaries, hollow fibers, needles, solid 
fibers, cellulose beads, pore-glass beads, 
silica gels, polystyrene beads optionally cross - 
linked with divinylbenzene , grafted co-poly 
20 beads, poly- aery lamide beads, latex beads, 

dimethylacrylamide beads optionally cross -linked 
with N,N' -bis-acryloyl ethylenediamine, glass 
particles coated with a hydrophobic polymer, 
etc., i.e., a material having a rigid or semi- 
25 rigid surface; and 

b) soluble supports such as low molecular weight 
non- cross -linked polystyrene. 

These materials must contain functionalities or must be 
30 able to be functionalized such that identifiers or product 
intermediates may be attached to them. 

In addition, the following abbreviations have the 
indicated meanings: 
35 AcOH = acetic acid 

BSA = bis(trimethylsilyl)acetamide 

CAN = cerium (iv) ammonium nitrate 
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DEAD 


= 


diethyl azodicarboxylate 


DCM 




dichloromethane 


Die 


= 


diisopropylcarbodiimide 


DMF 




N, N-dimethylf ormatnide 


Fmoc 




9 - f luorenylmethoxycarbonyl 


HOBt 




1 -hydroxybenzot riazole 


PhMe 




toluene 


r.t . 




room temperature 


TFA 




trif luoroacetic acid 


THF 




t e t r ahy dr o f ur an 



The subject invention concerns the production of libraries 
of products, i.e. compounds, where the individual products 
or compoimds present in the libraries may be physically 

15 separated from one another and may be screened for a 
characteristic of interest either bound to, or detached 
from, a solid support. By having serial syntheses, where 
at each stage of a synthesis each of the individual 
intermediates is treated in a variety of ways, a very 

20 large number of products is produced, each of which is 
present in a small amount, frequently less than 100 pmol, 
more frequently less than 10 nmol. Because of the small 
quantity of final product or compound so produced, 
identifying these products by isolating and structurally 

25 elucidating the products would generally not be feasible. 
Moreover, in sequential synthesis involving other than the 
addition of similar units, the analysis would be arduous 
if not impossible using the amount of product typically 
available. However, by associating each choice or 

30 combination of choices (e.g., "add reagent A" or "add 
reagent A, then reagent B, and heat to 100°C for 2 hrs.") 
of the serial synthesis with a combination of identifiers 
which define the choice of variables such as reactant, 
reagent, reaction conditions, or a combination of these, 

35 one can use the identifiers to define the reaction history 
of each definable and separable substrate. The analysis 
of tags detached from the identifiers allows for ready 
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identif ication of the reaction history, at picomolar or 
lower concentrations, e.gr. femtoraolar or less. One can 
determine a characteristic of a product of a synthesis, 
usually a chemical or biological characteristic by various 
5 screening techniques, and then identify the reaction 
history and thereby the structure of that product, which 
has the desired characteristic, by virtue of the tags 
associated with the product . 

10 The use of the instant nu?.t:iple tag system avoids the 
necessity of carrying out a complicated cosynthesis which 
reduces yields amd requires multiple protecting groups, 
and avoids the necessity of using sequencable tags which 
are necessarily chemically labile. Both the necessity of 

15 multiple protecting groups and the intrinsic instability 
of all known sequencable tagging molecules (i.e., nucleic 
acid or peptide oligomers) severely limit the chemistry 
which may be used in the synthesis of the library element 
or ligand. 

20 

Moreover, the use of a binary, or higher order, multiple 
tag system reduces enormously the number of tags necessary 
to encode the reagent/ react. -9 nt choice in any stage in a 
synthesis. For example, if a particular synthetic stage 

25 could be carried with 128 different choices for reagent, 
the binary system would require only 7 tags. This can 
make the difference between a practical encoding system 
and an impractical one, because it may not be feasible to 
obtain and use the large number of distinguishable tags 

30 required by other systems. With th^ binary system of the 
invention, 30 distinguishable tags are available and are 
sufficient to encode >10^ different syntheses . 

Importantly, the present method employs tags which are 
35 detachable from a ligand or solid support for the purpose 
of decoding. Such detachability also allows the tags to 
be distinguished on more than one basis; in particular, 
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they can be separated ( e.cr, . on the basis of 
chromatographic retention time) and then analyzed ( e.g. , 
a second basis is a spectral property such as mass 
spectroscopy m/e, or electrophoricity) . Having multiple 
^ 5 bases for distinction allows the encoding of large amounts 

of information with a small number of tags. 

Detachment further allows tags to be detected at very low 
levels, because they can be removed from the support 
10 matrix on which the synthesis is effected and from the 
ligand synthesized, the presence of either of which could 
provide spurious background signals, e.g. by quenching 
fluorescence or the like. 

15 Detachable tags are also amenable to rapid analysis by 
automated sampling systems, and allow for selective 
derivatization for detection via functional groups, 
eliminating any incompatibility between the detection 
moiety and the reaction conditions used in the synthesis. 

20 

Inherent in any tagging scheme is the requirement that the 
chemical characteristics of the tags and the chemical 
stages for their incorporation be compatible with the 
characteristics of the ligand and the stages in their 

25 synthesis, and vice versa. The advantage of tags that are 
generally unreactive, as exemplified hereinafter by the 
substituted- aryloxypolymethylene moieties, is a greater 
range of chemical transf onnations and chemical 
functionality that can be employed in synthesis of the 

30 ligands. 

A further advantage of the chemically stable tags of this 
invention is their compatibility with a greater variety of 
rapid, convenient methods of separation and analysis, such 
35 as gas chromatography and mass spectrometry. Moreover, 
the organic tags of these invention generally do not 
specifically interact with biological receptors. Thus, 
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these tags will generally not give spurious results in 
biological assays and will generally not be modified by 
enzymes or other biological molecules. 

5 Finally, the chemical stability of the present tags allows 
them to be detached by a wide variety of methods which 
improves sensitivity in their analysis as described above. 

Thus, this invention provides methods and compositions for 
10 encoded combinatorial synthesis whereby at each stage of 
the synthesis one or more identifiers are provided which 
encode an event associated with a particle stage in the 
synthesis of a compound on a support or particle. This, 
event comprises the choice of reactant and/or reaction 
15 conditions at that stage of the reactions where each such 
stage may involve one or more reactants which are the same 
or different under the same or different conditions, e.g. 
partial reactions, multiple additions, rate of addition, 
differing combinations of reagents, etc. In addition, 
20 groups of particles may be sequestered from other groups 
of particles and subjected to a different series of events 
at any time during the course of the sequential synthesis. 

By providing N identifiers, each having M distinguishable 
25 states, different syntheses can be uniquely defined. In 
the case of M=2 where the two states could be the presence 
or absence of identifier, the synthesis would thus be 
defined by a base 2 or binary code. In the case of M=3 
where the three states could be the presence of an 
30 identifier at two distinguishable concentrations or its 
absence, the synthesis would be defined by a base 3 code. 
Herein, such base M codes where M>2 are termed higher 
order codes. The advantage of higher order codes over a 
binary code is that fewer identifiers are required to 
35 encode the same quantity of information about the 
synthesis. The products which are produced will be 
defined as resulting from a serial synthesis. At each 
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stage in the synthesis, there is available a plurality of 
reactants and/or reagents and/or conditions, which result 
in a feature of the product in relation to an identifiable 
and usually separable entity, e.g. tag. In referring to 
5 reactants and reagents, it is intended that the reactant, 
for the most part, becomes incorporated into the product, 
e.g. an amino acid, nucleotide, nucleophile, electrophile, 
diene, alkylating or acylating agent, diamine, or any 
other synthon, etc. while a reagent may or may not become 

10 incorporated into the product, e.g. base, acid, heat, 
oxidizing or reducing agent, while both will be included 
under the term "agent". The synthesis may involve 
individual reactants which become incorporated into the 
product. Alternatively, a stage may involve one or more 

15 reactions which result in a modification of a reaction 
intermediate. In many cases, combinations of these 
possibilities will be involved. 

Using a base 2 or binary code (M=2) and three identifiers 
20 (N=3), as many as 8 (2^) agents for a given stage in a 
synthesis may be encoded If the three identifiers are 
represented as Tl, T2, and T3 and the presence or absence 
of each identifier is represented as a '0' or '1' 
respectively, then eight different agents could be 
25 represented in a binary code as follows: 





Agent 1 


Agent 2 


Agent 3 


Agent 4 


Tl , T2 , T3 


0,0,0 


1,0,0 


0,1,0 


1,1/0 




Agent 5 


Agent 6 


Agent 7 


Agent 8 


T1,T2,T3 


0,0,1 


1,0,1 


0,1,1 


1,1,1 



30 

Similarly, even more information about the synthesis may 
be encoded by more identifiers. For example, 9 
identifiers (N=3) and a base 2 code (M=2) would allow up 
to 2' or 512 different agent choices to be encoded. Using 
35 a base 3 code (M=3) and three identifiers (N=3) would 
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allow as many as 27 (3^) agent choices to be encoded. If 
the three identifiers are represented as Tl, T2 and T3, 
and the absence of an identifier is represented as a '0', 
its presence at a quantity of -0.5 pmol/bead as a '1', and 
5 its presence at a quantity at -1.0 pmol/bead as a '2', 
then the 27 different agents could be represented by three 
identifiers in base 3 code as: 







Agent 1 


Agent 2 


Agent 3 


Agent 4 


10 


T1,T2,T3 


0,0,0 


1,0,0 


2,0,0 


0,1,0 






Agent 5 


Agent 6 




Agent 27 




Tl , T2 , T3 


1,1,0 


2,1,0 




2,2,2 



To make such higher order encoding schemes practical, one 
15 additional identifier at a given quantity (e.g., -1.0 
pmol/bead) would be added to all members of the library to 
provide a standard against which the quantities of all 
identifiers would be measured. The quantities of the 
identifiers could be measured by gas chromatography or 
20 HPLC with a variety of detection methods. In the case of 
HPLC, quantities could be conveniently measured by 
scintillation counting if the identifiers were 
radioactively labeled by different quantities of a 
radionuclide such as tritium (^H) . It would be particularly 
25 convenient to carry out the c[uantitation by measuring the 
^H-to-"C ratio, thus using "C as a standard. In this way, 
as many as ten quantities of could be distinguished to 
create a base 10 or decimal code (M=10) which could encode 
enormous amounts of information with very few identifiers . 
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Products and Synthetic Strategies 

For the most part, the products of the method of this 
invention will be organic compounds where the serial 
synthesis will involve the addition or removal of chemical 
5 units, reactions involving the modification or 
introduction of one or more functionalities, ring 
openings, ring closings, etc. Chemical units can take 
many forms, both naturally-occurring and synthetic, such 
as nucleophiles, electrophiles, dienes, alkylating or 
10 acylating agents, diamines, nucleotides, amino acids, 
sugars, lipids, or derivatives thereof, organic monomers, 
synthons, and combinations thereof. Alternatively, 
reactions may be involved which result in alky lat ion, 
acylation, nitration, halogenation, oxidation, reduction, 
15 hydrolysis, substitution, elimination, addition, and the 
like. This process can produce non-oligomers, oligomers, 
or combinations thereof in extremely small amounts, where 
the reaction history, and composition in appropriate 
cases, can be defined by the present tags. Non-oligomers 
20 include a wide variety of organic molecules, e.g. 
heterocyclics, aromatics, alicyclics, aliphatics and 
combinations thereof, comprising steroids, antibiotics, 
enzyme inhibitors, ligands, hormones, drugs, alkaloids, 
opioids, terpenes, porphyrins, toxins, catalysts, as well 
25 as combinations thereof. Oligomers include oligopeptides, 
oligonucleotides , oligosaccharides, polylipids, 
polyesters, polyamides, polyurethanes, polyureas, 
polyethers, poly (phosphorus derivatives) e.g. phosphates, 
phosphonates, phosphoramides , phosphonamides, phosphites, 
30 phosphinamides, etc., poly (sulfur derivatives) e.g. 
sulfones, sulfonates, sulfites, sulfonamides, 
sulfenamides, etc., where for the phosphorous and sulfur 
derivatives the indicated heteroatom for the most part 
will be bonded to C, H, N, 0 or S, and combinations 
35 thereof. 
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Reactions may involve modifications at a variety of random 
sites of a central core molecular structure or 
modifications at a specific site. For example, one may 
brominate a polycyclic compound, where bromination may 
5 occur at a plurality of sites or use a brominating agent 
which will be specific for a particular site, e.g., N- 
bromosuccinimide . For the most part, reactions will 
involve single sites or equivalent sites, for example, one 
of two hydroxyl groups of a glycol. 

10 

For the most part, the subject synthesis will have at 
least two stages where other than bifunctipnal compounds 
are attached using the same linking functionality, e.g. 
amino acids and amide bonds, nucleotides and phosphate 
15 ester bonds, or mimetic compounds thereof, e.g., aminoiso- 
cyanates and urea bonds. 

The methods of the invention permit variation in reaction 
at each stage, depending on the choice of agents and 

20 conditions involved. Thus, for amino acids, one may have 
up to 20 amino acids involved using the common naturally- 
encoded amino acids and a much wider choice, if one wishes 
to use other amino acids, such as D-amino acids, amino 
acids having the amino group at other than the of-position, 

25 amino acids having different sxibstituents on the side 
chain or substituents on the amino group, and the like. 
For the different nucleic acids, there will usually be up 
to 4 natural nucleic acids used for either DNA or RNA and 
a much larger number is available if one does not choose 

30 to use those particular nucleic acids. For the sugars and 
lipids, there are a very large number of different 
compounds, which compounds may be further increased by 
various substitutions, where all of these compounds may be 
used in the synthesis. For individual organic compounds 

35 the choice may be astronomically large. In addition, one 
may have mimetic analogs, where ureas, urethanes, 
carbonylmethylene groups, and the like may siibstitute for 
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the peptide linkage; various organic and inorganic groups 
may substitute for the phosphate linkage; and nitrogen or 
sulfur may substitute for oxygen in an ether linkage or 
vice versa. 

5 

The synthetic strategies will vary with the nature of the 
group of products one wishes to produce. Thus, the 
strategy must take into consideration the ability to 
stage-wise change the nature of the product, while 

10 allowing for retention of the results of the previous 
stages and anticipating needs for the future stages. 
Where the various units are of the same family, such as 
nucleotides, amino acids and sugars, the synthetic 
strategies are relatively well -established and frequently 

15 conventional chemistry will be available. Thus, for 
nucleotides, phosphoramidite or phosphite chemistries may 
be employed; for oligopeptides, Fmoc or Boc chemistries 
may be employed where conventional protective groups are 
used; for sugars, the strategies may be less conventional, 

20 but a large number of protective groups, reactive 
functionalities, and conditions have been established for 
the synthesis of polysaccharides. For other types of 
chemistries, one will look to the nature of the individual 
unit and either synthetic opportunities will be known or 

25 will be devised, as appropriate. 

In some instances, one may wish to have the same or 
different blocks introduced at the same or different 
stages. For example, one may wish to have a common 

30 peptide functional unit, e.g. the fibronectin binding unit 
(RGDS) , a polysaccharide, e.g. Le"", an organic group, e.g. 
a lactam, lactone, benzene ring, olefin, glycol, 
thioether, etc. introduced during the synthesis. In this 
mcinner one may achieve a molecular context into which the 

35 variation is introduced. These situations may involve 
only a few stages having the plurality of choices, where 
a large number of products are produced in relation to a 
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particular functional entity. This could have particular 
application where one is interested in a large number of 
derivatives related to a core molecule or unit known to 
have a characteristic of interest. 

5 

In developing synthetic strategies, one can provide for 
batch synthesis of a few compounds which would be prepared 
during the course of the combinatorial synthesis- By 
taking extreme examples, for example, syntheses which 

10 might involve steric hindrance, charge and/or dipole 
interactions, alternative reaction pathways, or the like, 
one can optimize conditions to provide for enhanced yields 
of compounds which might not otherwise be formed or be 
formed only in low yield. In this manner, one may allow 

15 for a variety of reaction conditions during the 
combinatorial synthesis, involving differences in solvent, 
temperatures, times, concentrations, and the like. 
Furthermore, one may use the batch syntheses, which will 
provide much higher concentrations of particular products 

20 than the combinatorial synthesis, to develop assays to 
characterize the activity of the compounds. 

Supports: Attachment and Detachment 

The synthetic protocol requires that one provide for a 
25 plurality of different reactions involving different 
reactants resulting in a plurality of different 
intermediates at each stage of the synthesis. While other 
techniques are available, this can be achieved most 
conveniently by employing small definable solid . 
30 substrates, commercially available as beads, which can be 
readily mixed, separated, and serve as a solid substrate 
for the sequential synthesis. The solid substrates may be 
solid, porous, deformable or hard, and have any convenient 
structure and shape. In some instances, magnetic or 
35 fluorescent beads may be useful. The beads will generally 
be at least 10-2000 /im, usually at least 20-500 /xm, more 
usually at least 50-250 /xm in diameter. 
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Any convenient composition can be used for the particles 
or beads, which bead composition will maintain its 
mechanical integrity during the various process stages, 
can be functionalized, has functional groups or allows for 
5 reaction with an active species, allows for the serial 
synthesis as well as attachment of the identifiers, can be 
readily mixed and separated, and will allow for convenient 
detachment of the tags and products. Beads which may be 
employed include cellulose beads, controlled-pore glass 

10 beads, silica gel, polystyrene beads, particularly 
polystyrene beads cross -linked with divinylbenzene, 
grafted co-polymer beads such as 
polyethyleneglycol/polystyrene , polyacrylamide beads , 
latex beads, dimethylacrylamide beads, particularly cross - 

15 linked with N,N' -bis-acryloyl ethylene diamine and 
comprising N-t.-butoxycarbonyl-iS-alanyl-N' -acryloyl 
hexamethylene diamine, composites, such as glass particles 
coated with a hydrophobic polymer such as cross- linked 
polystyrene or a fluorinated ethylene polymer to which is 

20 grafted linear polystyrene; and the like. General reviews 
of useful solid supports (particles) that include a 
covalently- linked reactive functionality may be found in 
Atherton, et al., Prospectives in Peptide Chemistry , 
Karger, 101-117 (1981); Amamath, etal,, Chem. Rev. 

25 77:183-217 (1977); and Fridkin, The Peptides . Vol. 2, 
Chapter 3, Academic Press, Inc., (1979), pp. 333-363. 

Depending upon the nature of the synthetic procedure or 
the assay of the final product, one or another bead may be 
30 more or less desirable. While beads are especially 
convenient, other solid supports may also find use, such 
as capillaries, hollow fibers, needles, solid fibers, 
etc., where the size of the solid support allows for the 
desired variation in reaction histories. 

35 

Depending upon the nature of the synthesis, the beads may 
be functionalized in a variety of ways to allow for 
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attachment of the initial reactant. These may be linked 
through a non- labile linkage such as an ester bond, amide 
bond, amine bond,- ether bond, or through a sulfur, 
silicon, or carbon atom, depending upon whether one wishes 
5 to be able to remove the product from the bead. 
Conveniently, the bond to the bead may be permanent, but 
a linker between the bead and the product may be provided 
which is cleavable such as exemplified in Table 1. Two or 
more different linkages may be employed to allow for 
10 differential release of tags and/or products. 

Depending upon the nature of the linking group boiind to 
the particle, reactive functionalities on the bead may not 
be necessary where the manner of linking allows for 
15 insertion into single or double bonds, such as is 
available with carbenes and nitrenes or other highly- 
reactive species. In this case, the cleavable linkage 
will be provided in the linking group which joins the 
product or the tag to the bead. 

20 

Desirably, when the product is permanently attached, the 
link to the bead will be extended, so that the bead will 
not sterically interfere with the binding of the product 
during screening. Various links may be employed, 
25 particular hydrophilic links, such as polyethyleneoxy, 
saccharide, polyol, esters, amides, combinations thereof, 
and the like. 

Functionalities present on the bead may include hydroxy, 
30 carboxy, iminohalide, amino, thio, active halogen (Cl or 
Br) or pseudohalogen (e.g., -CF3, -CN, etc.), carbonyl, 
silyl, tosyl, mesylates, brosylates, triflates or the 
like. In selecting the functionality, some consideration 
should be given to the fact that the identifiers will 
35 usually also become bound to the bead. Consideration will 
include whether the same or a different functionality 
should be associated with the product and the identifier, 
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as well as whether the two functionalities will be 
compatible with the product or identifier attachment and 
tag detachment stages, as appropriate. Different linking 
groups may be employed for the product, so that a specific 
5 quantity of the product may be selectively released- In 
some instances the particle may have protected 
functionalities which may be partially or wholly 
deprotected prior to each stage, and in the latter case, 
reprotected. For example, amino may be protected with a 
10 carbobenzoxy group as in polypeptide synthesis, hydroxy 
with a benzyl ether, etc. 

Where detachment of the product is desired, there are 
numerous functionalities and reactants which may be used. 

15 Conveniently, ethers may be used, where substituted benzyl 
ether or derivatives thereof, e.g. benzhydryl ether, 
indanyl ether, etc. may be cleaved by acidic or mild 
reductive conditions. Alternatively, one may employ 
jS-elimination, where a mild base may serve to release the 

20 product. Acetals, including the thio analogs thereof, may 
be employed, where mild acid, particularly in the presence 
of a capturing carbonyl compoiind, may serve. By combining 
formaldehyde, HCl and an alcohol moiety, an a-chloroether 
is formed. This may then be coupled with an hydroxy 

25 functionality on the bead to form the acetal. Various 
photolabile linkages may be employed, such as 
o-nitrobenzyl, 7-nitroindanyl, 2-nitrobenzhydryl ethers or 
esters, etc. Esters and amides may se3rve as linkers, 
where half-acid esters or amides are formed, particularly 

30 with cyclic anhydrides, followed by reaction with hydroxyl 
or amino functionalities on the bead, using a coupling 
agent such as a carbodiimide . Peptides may be used as 
linkers, where the secjuence is subject to enzymatic 
hydrolysis, particularly where the enzyme recognizes a 

35 specific sequence. Carbonates and carbamates may be 
prepared using carbonic acid derivatives, e.g. phosgene, 
carbonyl diimidazole, etc. and a mild base. The link may 
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be cleaved using acid, base or a strong reductant, e.g., 
LiAlH4, particularly for the carbonate esters. For a list 
of cleavable linkages, see, for example, Greene and Wuts, 
Protective Groups in Organic Synthesis, 2nd ed. Wiley, 
5 1991. The versatility of the various systems that have 
been developed allows for broad variation in the 
conditions for attachment of products and identifiers and 
differential detachment of products and tags, as desired. 

10 The following table indicates various illustrative linking 
units ( i.e. , in Formula I) and the manner in which they 
may be cleaved: 
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Table 1. Various illustrative linking units and the 
manner in which they may be cleaved. 



5 


Linking Group 


Cleavage Reagent 




silyl 


fluoride or acid 




A 


hv 




B 


Ce (NH4)2(N03)6 




-NCO2 (L) * 


OH", K\ or LiAlH4 


10 


C 


O3, Os04/I04", or KMnO^ 




D 


1) O2 or Br2, MeOH 

2) H3O* 




-Ol- {Li) 


oxxuauxon/ ri , ^^2, v^-^2' 
etc. 




Em 






T? 
r 






r* 
\j 


£* Kjjm n 




•LI 




20 


X = keto . ester . amide * 
NO2, sulfide, sulfoxide, 
sulfone, and related 
electron withdrawing 
groups 






I 


H3O* or reduction (e.g. 
Li/NH3) 




J 


(03P)3RhCl(H) 


25 


K 


Li,Mg, or BuLi 




M 


Hg*2 




N 


Zn or Mg 


30 


X = halogen or 
pseudohalogen 






0 


oxidation (e.g. Pb(0Ac)4 
or HglOe) 




P 


base 


35 


X = electron withdrawing 
group 





*(L) shows the point of attachment of the tag or product. 
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(L) is the tag or product either directly bonded to the 
indicated atom or indirectly bonded through a linking 
group such as C{0)0, which linking group may provide a 
convenient functionality. 
5 R is H or lower alkyl. 

Linker . 

The choice of linker for the ligand will be part of the 
synthetic strategy, since the linking group may result in 

10 a residual functionality on the product. It is feasible 
to further modify the product after detachment from the 
bead. In designing the synthetic strategy, one can use a 
functionality to be retained in the product as the point 
of attachment for the linking group. Alternatively, when 

15 permitted by the nature of the product, one could use a 
cleavage or detachment method which removes the linking 
functionality, e.g., an arylthioether or silyl with a 
metal hydride or acid. Since in many cases the synthetic 
strategy will be able to include a f unctionalized site for 

20 linking, the functionality can be taken advantage of in 
choosing the linking group. In some instances it may be 
desirable to have different functionalities at the site of 
linking the product to the support, which may necessitate 
using different modes of linking, which modes must 

25 accommodate either the same detachment method or different 
detachment methods which may be carried out concurrently 
or consecutively, e.g., irradiation with light and acid 
hydrolysis. 

30 Of particular interest for binding the identifiers to the 
particle are carbenes and nitrenes which can insert 
between a carbon and hydrogen atom to form a covalent 
bond, or into an olefinic bond to form a cyclopropane (in 
the case of carbene) or an aziridine (in the case of 

35 nitrene) . 
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With carbene or nitrene linking groups various substituted 
benzenes may be used, where the benzene is substituted 
with a group capable of providing a carbene: CHN2/ COCHNj, 
SO2CHN2; or nitrene: N3, NO2/ NO, SO2N3. The carbenes may 
5 be generated from diazoalkane derivatives by photolysis, 
thermolysis, or by treatment with low valent transition 
metal species, e.g., Rh(0Ac)2- The nitrene may be 
generated by photolysis or thermolysis from azides; and 
from nitro, nitroso and azides by using tervalent 
10 phosphorus compounds or low valent transition metals. 

A group of linker moieties (F^-F^-) of particular interest 
include 2-nitro-4 -carboxybenzyloxy , 2 -nitro- 4 - 
diazoacetylbenzyloxy, 4 or 5 azidomethylcarbonyl-2- 
15 methoxyphenoxy, and 2-methoxy-4, or 5-carboxyphenoxy 
moieties . 

Illustrative con^jounds where T represents the tag, Z 
represents a carbene or nitrene precursor or a carboxy 

20 group, and R is H or lower alkyl are as follows. For 
photochemical tag detachment (e.g., with ultraviolet light 
at about 350 nm) : T 3-Z-2-nitrobenzyl ether, T 4-Z-2- 
nitrobenzyl ether, T 5-Z-2-nitrobenzyl ether, T 6-Z-2- 
nitrobenzyl ether, T 2-Z-4-nitrobenzyl ether, T 3-Z-4- 

25 nitrobenzyl ether, T 3-Z-2-nitrobenzyl carbonate, T 4-Z-2- 
nitrobenzyl carbonate, T 5 -Z- 2 -nitrobenzyl carbonate, T 6- 
Z-2 -nitrobenzyl carbonate, T 2 -Z-4 -nitrobenzyl carbonate, 
and T 3-Z-4-nitrobenzyl carbonate. For oxidative 

detachment (e.g., using eerie ammonium nitrate): l-OT-2- 

30 OR-3-Z-benzene, l-0T-2-0R-4-Z-benzene, 1-OT-2-OR-5-Z- 
benzene, l-OT-2-OR-6-Z-benzene, l-OT-4-OR-2-Z-benzene, and 
l-OT-4-OR-3-Z-benzene. For reductive or alkylative 
detachment (e.g. with lithium/ammonia or methyl iodide) : 
T (2-Z-phenyl) thioether, T ( 3 -Z -phenyl ) thioether, and T 

35 (4 -Z-phenyl) thioether . For desilylative detachment (e.g., 
using tetrabutyl ammonium fluoride or acid) : T dialkyl- 
(2 -Z-phenyl) silyl ether, T dialkyl- (3 -Z-phenyl) silyl 
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ether, T dialkyl- {4-Z-phenyl) silyl ether, T-dialkyl- (2-Z- 
phenyl) silane, T-dialkyl- (3-Z-phenyl) silane, and T- 
dialkyl- (4-Z-phenyl) silane. 

5 Combinatorial Synthesis 

The synthesis will usually involve stages involving at 
least 2 choices, frequently at least 4 choices, and may 
involve 10 choices or more. Generally, the number of 
choices per stage will not exceed about 100, more usually 

10 not exceed about 50. The number of stages will usually be 
at least about 3, more usually at least about 4, 
frequently at least 5, and not more than about 30, more 
usually not more than about 25, preferably not more than 
about 20, more preferably not more than about 10, 

15 frequently not more than about 8 • 

The number of choices and stages will usually result in at 
least a number of compoiinds which allows for a sufficient 
variety to provide a reasonable likelihood that at least 

20 one compound will have the characteristic of interest. 
The subject methodology allows for producing greater than 
25,000 compounds, usually greater than 50,000 compounds, 
preferably greater than 200,000 compounds, and a million 
or more may be produced. This will usually mean at least 

25 20 compounds but may be 10^ or more. 

In some syntheses, a stage may only involve one or two 
choices, but this situation will usually be limited in 
relation to the number of compounds one wishes to produce 

30 and the particular synthetic strategy. In many of the 
strategies, the restricted number of choices, i.e., fewer 
than 5 choices, more usually 2 or fewer choices, will be 
limited to the greater of 40% of the total number of 
stages or about 2 stages in the sequential synthesis, more 

35 usually limited to 20% of the total number of stages. 
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Reaction Procedure. 

In carrying out the synthesis, one may initially begin 
with a number of beads, usually at least 10^, more usually 
10^, and desirably at least 10^, while generally not 
5 exceeding 10", more usually not exceeding at least 10^**, 
Depending upon the number of choices in the first stage, 
one will divide up the particles accordingly into as many 
containers. One can use microtiter well plates, 
individual containers, columns, gels, Terasaki plates, 
10 flasks, Merrifield synthesis vessels, etc. The particles 
will usually be divided up into groups of at least one 
particle each, usually a plurality of particles, generally 
1000 or more, and may be 10^ or more depending on the total 
number of particles and choices involved in the stage. 

15 

One would then add the appropriate agents to each of the 
individual containers to process them in stages and add 
the identifiers which encode the reagent and stage. Each 
stage would provide the desired reaction. Once the 

20 reaction (s) is complete, one may wish to wash the beads 
free of any reagent, followed by combining all of the 
beads into a single mixture and then separating the beads 
according to the number of choices for the next stage. 
This procedure of dividing beads, followed by the tagging 

25 and synthesis stages (or vice versa) , and then recombining 
beads is iterated until the combinatorial synthesis is 
completed. 

In some instances, the same reaction may be carried out in 
30 2 or more containers to enhance the proportion of product 
having a particular reaction at a particular stage as 
compared to the other choices. In other instances, one or 
more of the stages may involve a portion of the beads 
being set aside and undergoing no reaction, so as to 
35 enhance the variability associated with the final product. 
In other situations, batches may be taken along different 
synthetic pathways. 
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In order to record or encode the synthesis history on the 
beads, at each stage one would tag the beads associated 
with each choice and stage with their own unique 
combination of identifiers. Alternately one may use a 
5 single tag to record or encode this synthesis history. 
Depending on the chemistries involved, this tagging may be 
done prior to, after, or concomitantly with the reactions 
which comprise each choice. Further, as a control, sample 
beads may be picked at any stage and a portion of their 
10 tags cleaved off and decoded to verify that the correct 
tags are bound to the sample beads. 

As indicated previously, in some instances, portions of 
the particles will be segregated into subsets, where each 

15 of the subsets would then undergo a different reaction 
series. At any time, the portions may be recombined into 
a single mixture for subsequent reaction. For example, if 
at one stage one introduces unsaturation, one could 
provide two subsets, where in one subset the unsaturation 

20 is reduced, while in the other subset the unsaturation is 
epoxidized. These two subsets could then be subjected to 
different reaction series. 

After synthesis of the products is complete, they are 
25 screened for a desired property either after detachment of 
the ligand from the bead or while still attached. In the 
latter case, beads, for example, may be incubated in 
aqueous buffer with mouse monoclonal antibody Y. After 
incubation and washing, the beads are incubated with 
30 alkaline phosphatase -conjugated rabbit (or goat) 
polyclonal antibody directed against mouse antibodies. 
Using a fluorescent precipitation developing reagent, 
fluorescent beads with attached monoclonal antibody are 
identified and manually separated from the majority of 
35 clear, unstained beads. Alternatively, the fluorescent 
beads can be separated using a fluorescence-activated cell 
sorter, so long as the tags are retained on the bead under 
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the conditions of sorting. Each selected fluorescent bead 
is subjected to a means for releasing at least some 
portion of all of the tags from the bead. 

5 In instances where the synthesis does not involve the 
stagewise addition of like units, or where reaction 
byproducts are formed, there may be instances where there 
will be a plurality of compounds on a single bead or the 
structure of the active compound cannot be known from its 
10 reaction history. In rrcordance with the subject 
invention, by knowing the reaction history, one may repeat 
the synthesis on a larger scale so as to obtain a 
sufficient amount of the product (s) to isolate the 
product (s) and structurally identify the active compound. 

15 

The subject methodology may be illustrated using various 
reaction sequences. For example, barbiturates may be 
prepared by combining an aldehyde or ketone with an 
acetate ester to prepare a crotonate under Claisen 

20 conditions to provide an unsubstituted to tetrasubstituted 
crotonate. The crotonate may then be combined with a 
second acetate under Michael conditions, whereby a 
glutarate may be obtained having up to 6 substituents. 
The glutarate may then re combined with ammonia or 

25 monosubstituted amine zo provide the barbiturate . By 
varying the aldehydes and ketones, the acetates and the 
amines, a great variety of barbiturates may be obtained. 
Where functionalities are present on one or more of the 
substituents, such as amino, carboxy, hydroxy, thiol, and 

30 the like, these groups may be protected or modified as 
desired. 

In another example described by Bunin and Ellman, J". Am. 
Chem. Soc, 114, 10997 (1992), benzodiazepines are 
35 produced. One begins the synthesis with different amino 
protected substituted 2-aminobenzophenones bound to 
individual particles through, for example, a 4'-oxy group. 
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To each different group of particles in different vessels, 
after deprotection, are added a different Fmoc -protected 
ff-amino acid, either naturally occurring or synthetic, 
under conditions where a peptide bond is formed. After 
5 deprotection, internal cyclization is caused, followed by 
alkylation on nitrogen with an alkylating agent. In only 
-three stages, a very large number of benzodiazepines are 
prepared and the libraries may be screened for 
tranquilizing or other activity, 

10 

A wide variety of drug analogs may be produced, such as 

analogs of antihypertensive agents, e.g. enalapril; 

)3-blockers, e.g. propanolol; antiulcer drugs (Hj-receptor 

antagonists) e.g. cimetidine and ranitidine; antifungal 
15 agents (cholesterol-demethylase inhibitors) e.g. 

isoconazole; anxiolytics, e.g. diazepam; analgesics, e.g. 

aspirin, phenacetamide, and fentanyl; antibiotics, e.g. 

vancomycin, penicillin and cephalosporin; 

antiinflammatories, e.g. cortisone; contraceptives, e.g. 
20 progestins; abortif acients, e.g. RU-456; antihistamines, 

e.g. chlorphenamine ; antitussives, e.g. codeine; 

sedatives, e.g. barbitol;, etc. 

An illustrative synthesis of cimetidine analogs could 
25 involve hydroxymethylsubstituted histidines, and related 
heterocycles, where the remaining carbon atoms or nitrogen 
atoms could be further substituted or unsubstituted, 
a,(i)-aminoalkylthiols, and substituted thioamidine esters, 
where the groups on nitrogen could be varied, such as 
30 nitro, cyano, hydroxy, alkyl, combinations thereof, and 
the like. 

Identifier 

The identifiers of this invention may be represented by 
35 the Formula I: 
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where F^-F^ is a linker which allows for attachment to a 
support and detachment of the tag from a support; and 

C-E-C'is the tag which is capable of detection and 
distinguishability; 
5 E is a tag component which (a) allows for detection, 

such as an electrophoric group which can be analyzed by 
gas chromatography or mass spectroscopy or (b) allows for 
detection and for separation; 

C and C' are tag components which allow for 
10 individually distinguishing one tag from all other tags, 
usually allowing for separation as a result of variable 
length or substitution, for example, varying the 
chromatographic retention time or the mass spectroscopy 
ratio m/e; 

15 F^ is a linking component capable of being selectively 

cleaved to release the tag component; and 

F^ is a functional group which provided for attachment 
to the support; or 

F^ is a bond when F^ is a cleavable group such as OH 
20 or carboxy. 

Although the identifiers of Formula I are typically added 
at each appropriate stage and choice during the 
combinatorial synthesis, the portion E can be added at the 

25 end of the syntheses either before or after cleavage 
(preferably photochemically or oxidatively) from the 
substrate. Specifically, where C contains OH, NHR*, or SH, 
E can be attached to C prior to cleavage. Alternatively, 
if E is attached after cleavage, the point of attachment 

30 at C may be where F^ was attached. This is exemplified in 
the scheme on the following page: 



\ 
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^ N(CH3)2 

20 

where S = siibstrate and 



n = 1-40 

Attachment of the identifier to the siibstrate can be 
represented as follows: 

25 F^-F^-C-E-C' + S > S-F^' -F^-C-E-C' 

where F^' -F^-C-E-C' represents the identifier residue 
attached to the sxibstrate. For example, when the bead is 
functionalized with an aminomethyl group and F^ is CO2H, 
then F^' is -C(0)-; when the bead contains an unsaturated 
30 bond and F^ is NaCH-CiO)-, then F^' is =CH-C(0)- or 
-CH2-C(0) 

Of particular interest for use as identifiers are 
compounds of Formula I of the Formula la: 

F^-f2-(C(E-C') la 
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wherein : 

is COjH, CHjX, NR^RS C(0)RS OH, CHN2, SH, C(0)CHN2, 
SCOaCl, SCOjCHNj, N3, NO2, NO, S(0)2N3, OC(0)X, C(0)X, 
NCO, or NCS; 
5 p2 is 
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— C(OH)I^^ C(R^2^ — • — C(OH)R — C(CH2X)R— , 



— C(OH)R— C(R^2 — C(X)R^ — , — C(OH)(CH2CH2X) — - 

, OR A — 
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A is -0-, ^0C(0)0-, -0C(0)-, or -NHC(O)-; 

C is a bond; Ci-Cjo alkylene optionally siibstituted by 1-40 
F, CI, Br, Ci-Cfi alkoxy, NR*R\ 0R\ or NHRS or 
-[(C{R^)2)„-Y-Z-Y-(C(R^)2)„Y-Z-Y]p-; with the proviso 
5 that the maximum number of carbon atoms in C+C' is 

preferably 20; 

CI is H; F; CI; C^-Cjo alkyl optionally substituted by 

1-40 F, CI, Br, Ci-Cg alkoxy, NR^R*, 0R\ or NHR\ or 
- [(C(R^)2)„-Y-Z-Y-(C(R^)2)„Y-Z-Y]p-; 
10 E is Ci-Cio alkyl substituted by 1-20 F, Cl or Br; or 

Q-aryl wherein the aryl is substituted by 1-7 F, Cl, 
NO2, S02R^, or substituted phenyl wherein the 
substituent is 1-5 F, Cl, NOj, or SOaR^; 

E-C' may be -H, -OH, or amino; 
15 R^ is H or C^-Cg alkyl; 

R^ is C=0, C(0)0, C(0)NRS S, SO, or SO2; 

R^ is H or Ci-Cfi alkyl; 

R^ is Ci-Cfi alkyl; 

a is 1-5; 
20 b is 1-3; 

m and n is each 0-20; 

p is 1-7; 

Q is a bond, O, S, NHR\ 0=0, -C{0)NRS -NR^C(O)-, -C(0)0-, 
or -0C(0) -; 

25 X is a leaving group such as Br, Cl, triflate, mesylate, 
tosylate, or 0C(0)0R^; 
Y is a bond, O, S, or NHR*; 

Z is a bond; phenylene optionally substituted by 1-4 F, 
Cl, Br, Ci-Cg alkyl, C^-Cg alkoxy, C^-Ce alkyl substituted 
30 by 1-13 F, Cl, or Ci-Cg alkyloxy sxibstituted by 1-13 

F, Cl, or Br; (C (R*) 3) 1-20/ (CFjji-so/ with the proviso 
that when Z is a bond one of its adjacent Y's is also 
a bond; and 

aryl is a mono- or bi -cyclic aromatic ring containing up 
35 to 10 carbon atoms and up to 2 heteroatoms selected 

from O, S, and N. 
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10 



In the definitions of in Formula la, the left-hand 
bond as depicted attaches to F^. 

Also useful as identifiers are compounds of the 
Formula la' ; 

F^- (C(E-C' )a la' 

wherein : 

F^ is OH or COOH; and 

the remaining definitions are as in Formula la. 
Preferred compounds of Formula la are those wherein; 
F^ is 



15 



F^ is 



CO2H. OH. CHN2> CtOCHK^. C(0)X. NCS. or CHgX; 



A — 




CHgA — 




CH2A — 



20 




OR 





25 




or 




35 



C and C' is each independently Ci-Cjo alkylene or Ci-Cjo 
alkyl, respectively, xinsubstituted or substituted by 1-40 
P or CI, or [0-(CH2)2.3]p; 

E is Ci-Cio alkyl substituted by 1-20 F or Cl; Q-aryl where 
aryl is a bi-cyclic aromatic ring substituted by 1-7 
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F or Cl; or Q-phenyl substituted by 1-5 F, CI, NOj, or 
SOgR^; and 

Q is a bond, O, -NR^C(O)-, or -0C(0)-. 

5 Preferred compounds of Formula la are those wherein 
-C(E-C')a is represented by - (CHj) o-is" (CFj) i.^gF, 

- {CH2)o.i5- (CCl2)i.,5Cl, - (CHjCHj-Oi.s-Ar, 

- (CHaCHjCHaOli.s-Ar, or - (CHj) l-x2•-0-Ar; 
wherein Ar is pentafluoro" pentachloro- , or 

pentabromophenyl , 2,3,5, 6- tetraf luoro-4 (2,3,4,5,6- 
pent af luorophenyl ) phenyl , 2,4,6- trichlorophenyl , 
2,4, 5 -trichlorophenyl, 2, 6-dichloro-4-f luorophenyl, 
or 2,3,5, 6 -tetraf luorophenyl . 

A preferred embodiment of Formula la is wherein F^ is COOH, 
CHN2, C(0)CHN2, 8(02) CHN2, COCl, OH, SH, CHjX, or NHR^; 



10 



15 
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wherein is 



* A — 




A OR® 



— A 



wherein A is -0- or -0C(0)0-; 

5 C(E-C')a is - (CRS)i,i5- (O)o-i-Ar, - (CR%) o-is" (CFj) 1-15-F, 
• - (CR%) o-is- (CF2) 1-15- (CR%) 0-15-H, or 
- (CH2) 1-20- ( (O) 0-1- (CHj) 1.1^) 0-24- (CH2) 0-24- (O) 0-1-Ar, 

wherein Ar is pentaf luoro- , pentachloro-, or 
10 pentabromophenyl , 2,3,5,6- tetraf luoro-4 (2,3,4,5,6- 

pentaf luorophenyl ) phenyl , 2,4,6- trichlorophenyl , 

2,4, 5 "trichlorophenyl , 2 , 6 -dichloro-4 -f luorophenyl , 
or 2,3,5, 6 -tetraf luorophenyl . 
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Other preferred compounds of Formula la are represented by 
the formulae: 



5 
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OMe 



5 




10 



(CH2)i,i5 0 Ar 



^ and 



25 




wherein Ar is pentafluoro-, pentachloro- , or 
30 pentabromophenyl , 2,3,5, 6-tetraf luoro-4 (2,3,4,5,6- 

pentafluorophenyl) phenyl , 2,4, 6-trichlorophenyl , 
2,4, 5 - trichlorophenyl , 2 , 6 -dichloro-4 - f luorophenyl , 
or 2,3,5,6- tetraf luorophenyl . 

Other preferred compounds of Formula la are those 

35 

wherein E-C' is H, OH, or NHj. Such compounds are 
particularly useful for reaction with an E at the end of 
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the combinatorial synthesis, especially with an E 
detectable by fluorescence or electron capture, such as 
dansyl chloride or polyhalobenzoylhalide. 

5 Another embodiment of the invention is represented by the 
following compound: 




0-20 



The compounds of Formula I can be prepared according to 
10 the following exemplary schemes or other means known to 
those skilled in the art. 

SCHEME 1 

Identifier Tag Preparation 

X 

lEq,CS2C03. DMF 

^^^^ HO^^'^^^'^^Br — ; " EO^^'^^^^^^O-'' 



X 

ELECTROPHORIC PHEHOL, ArOH 
X = CI OR F. 



wo 95/28640 ' PCT/USJW04683 

-46- 

SCHKME 2 

Identifiers With Photolytic Cleavacre Linkers 

5 



10 




PYRIDINE. CH2CI2 

0 
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SCHEME 3 

Identifiers With Oxidative Cleavage Linkers 



MeO 



OHe 



HO 0 



PPhj, DEAD. 
TOLUENE 



tteO 



cue 



1. LiOE. THF/IATER. /^^'^^W^^Ai 
BEFLUX 



CI. 



2. SOCLg. 



TOLDEVE REFLUX 



OMe 



EtgO/CHgCl; 



2"2 
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SCHEME 4 

Identifiers With Alternative Oxidative Release Linkers 



H 




OMe 



PPI13. DEAD. 



lOLUEKE 



H. 




OMe 



l.rniO^.ACETOHE/YHTER 



2. oxALOYi mmi, 

cat. DMF. CHjClj 



CI 




OMe 



CHoN. 



2"2 



Et20/CH2Cl2 




OMe 
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SCHEHS 5 

E-C' Tags 



COgH 



10 




I.BH3. THF 



2.C52CO3. Br(CH2)^CH3 




15 



OH 

SCHEME 6 

Identifiers With Photolvtic Cleavage Linkers 



1. COCL2. TOLUENE 
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SCHEME 7 

Identifiers With Oxidative Cleavage Linkers 
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The identifier may comprise one or a plurality of 
identical tags. The identifiers will be individual 
chemical compound (s) which may be distinguished one from 
the other and will uniquely identify different choipes and 
5 stages. In this manner, very large combinatorial 
libraries may be prepared with a relatively small number 
of identifiers, usually fewer than 50 tags. 
During each stage, a combination of identifiers will be 
added, which defines the stage and choice. Each 

10 identifier will be bound, either covalently or non- 
covalently to the bead or to the product, usually the 
bead. Combinations of identifiers are used to provide a 
binary or other code at each stage, whereby the choice and 
stage may be defined. The combination of identifiers may 

15 include zero or only one identifier. 

Tags 

So far as the tags (C-E-C' ) are concerned, the tags which 
20 are employed will be characterized as follows: by being 
removable from the bead by means depending on or F^, 
preferably by hydrolysis, photolysis or oxidation; by 
being individually dif f erentiable, usually separable; by 
being stable under the synthetic conditions; by encoding 
25 both stage and choice so as to uniquely define the choice 
of agent used at each stage in the synthesis; desirably, 
there should be an easy way to identify the various tags 
with readily-available equipment which does not require 
sophisticated technical capabilities to operate; they 
30 should be relatively economical and provide a strong 
signal based on a relatively few molecules; and the tags 
should provide sufficient sensitivity to permit 
distinguishing the tags from the other components which 
may be present during the tag determinations. 

35 

The tags may be structurally related or unrelated, as in 
a homologous series, repetitive functional groups, related 
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members of the Periodic Chart, different isotopes, 
combinations thereof, or the like. The tags may be used 
as elements of a binary code, so that one tag can define 
two choices, two tags can define four choices, three tags 
5 can define eight choices, five tags can define thirty- two 
choices, etc. Thus, at each stage of the synthesis, a 
relatively small number of tags can designate a much 
larger number of choices. The tags comprising the 
identifiers for each stage may or may not be related to 
10 other stages. Each tag for any combinatorial synthesis 
must allow for being distinguished from all other tags. 
In this manner, very large combinatorial libraries may be 
prepared with a relatively small number of tags, usually 
fewer than 60 tags, more usually fewer than about 50 tags. 

15 

For each bead, there will usually' be at least 0.01 
femtomol, more usually 0.001-50 pmol, of each tag, 
although lesser or greater amounts may be used in special 
circumstances. The amount of product may also be at least 

20 in the same range and up to at least 10* or more greater, 
usually being at least 0.01 pmol, more usually at least 
1-0 pmol and generally not more than about 10 nmol. 
Depending upon the number of beads, the number of stages 
and the number of choices per stage, the number of 

25 products produced will usually exceed 10^, more usually 
10\ and may exceed 10", usually not exceeding about 10% 
preferably being in the range of about 10* to 10% more 
usually 10^ to 10% 

30 The tags will, for the most part, be organic molecules. 
Each tag will usually have fewer than about 100 atoms, 
more usually fewer than about 80 atoms, generally fewer 
than about €0 atoms, other than hydrogen, excluding a 
linking moiety which would not be retained on release of 

35 the tag from the bead. The linking moiety may be of any 
size, usually being fewer than about 30 atoms, more 
usually fewer than 20 atoms, other than hydrogen. The 



wo 95/28640 



PCTAJS95/04683 



-SB- 
size of the linking moiety is not critical, but one of 
convenience. The tags may form families of compoiinds, 
where all of the compounds are of a similar nature or may 
be combinations of different families, where the compounds 
5 may be aliphatic, alicyclic, aromatic, heterocyclic, or 
combinations thereof. Distinguishing features may be the 
number of repetitive units, such as methylene groups in an 
alkyl moiety, alkyleneoxy groups in a polyalkyleneoxy 
moiety, halo groups in a polyhalocompound, a- and/or )8- 
10 substituted ethylenes, where the substituents may involve 
alkyl groups, oxy, carboxy, amino, halo, or the like; 
isotopes; etc. 

Tag Analysis 

15 Tags may be removed from the bead using reductive, 
oxidative, thermolytic, hydrolytic, or photolytic 
conditions depending on the nature of the group F^; for 
example, by oxidation of a catechol ether with eerie 
ammonium nitrate or by photolysis of a nitrobenzyl ether 
20 or ester or amide, or by other methods, e.g. as shown in 
Table 1. 

Differentiation of tags can be achieved with physical 
differences, e.g. molecular weight of the tags or the 
chromatographic retention time using gas or liquid 
chromatography. Positional isomers may have different 
retention times. If positional isomers or steroisomers 
are inadequate for physical separation, then one could use 
varying numbers of substituents, e.gr. halogens (such as 
fluorines) , methyl groups, oxy groups, or'= other side 
chains in conjunction with differing numbers of units, 
e.gr. methylene groups or ethyl eneoxy groups, to provide 
the desired separation. Ratios of radioisotopes could be 
used, where the radioisotopes provide for differential 
emission; for example "C and ^H. The physical differences, 
particularly mass number, can provide information about 
choice and stage. 



25 



30 



35 
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Instead of "C/^H ratios, one could use combinations of non- 
radioactive isotopes, e.g. -CH„D„, where m is 0 and up to 
3 and n is 3 minus m. For example, by detecting the 
varying amounts of up to four different methyl groups 
5 using mass spectroscopy, one could define a large number 
of choices. 

When E-C' is H, the tags obtained upon release from the 
support have an active functionality for reaction with a 

10 labeling reagent which introduces a detectable tag 
component E. Conveniently, the functionality could be a 
double bond (particularly an activated double bond) , 
hydroxy, thio, amino, carboxy, etc. The tag would then be 
reacted with an excess of the labeling reagent to provide 

15 the product (E-CH) for analysis. In this way a wide 
variety of labeling reagents could be used as part of the 
identifying system, which may not be compatible with the 
synthetic strategy for the product of interest. Labeling 
reagents which may be used for detection include 

20 haloaromatics (e.g., perfluorobenzyl bromide) , fluorescers 
(e.g., dansyl chloride), radioisotopes, chemiluminescers, 
etc. 

While exemplaiy tags and reactions have been given, it 
25 should be understood that many other combinations could be 
employed. 

Depending on the chemical and physical nature of the tags, 
an appropriate method for separation is chosen, desirably 

30 one of various chromatographic procedures including gas 
chromatography (GC) , liquid chromatography (LC) 
particularly high-performance liquid chromatography 
(HPLC) , thin layer chromatography (TLC) , electrophoresis, 
etc. Instead of a chromatographic procedure, mass 

35 spectrometry may be employed for separation by mass 
number. Tags include: 
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for GC: chemically inert organic molecules having the 
same or different molecular weights including alkanes, 
alkenes, arenes, halocarbons, ethers, alcohols, silanes, 
thioethers, etc., particularly halogenated compounds, with 
5 or without other functionalities, for electron capture 
detection or mass spectroscopy detection (MS) with 
capillary GC separation, and for compound with elements 
not normally found in organic chemistry (e.g., Sn, Ge) for 
atom emission detection with GC capillary seperation; 

10 for LC, HPLC or TLC: see above for GC, conveniently 
linear ethers or hydrocarbons with substitution by 
radioisotopes or combinations of radioisotopes for 
radioassay detection or suitcible groups for fluorescence 
detection after separation; 

15 for electrophoresis: see above, particularly 

functionalized charged molecules, e.gr. cat ionic or 
anionic, particularly organic or inorganic acid groups, 
where the molecule may be further modified by having a 
detectable radioisotope or fluorescer for detection by 

20 electrophoresis ; 

for mass spectroscopy: see above, particularly different 
mass numbers due to different isotopes, different numbers 
of the same functionality or different functionalities, 
different members of a homologous series or combinations 

25 thereof . 

The separation of tags from one another may involve 
individual techniques or combinations of techniques, e.g. 
chromatography and electrophoresis; gas chromatography and 
30 mass spectroscopy; etc. 

The tags of the present invention will have a property 
which allows detection at very low levels, usually not 
greater than nanomole, preferably picomole or less, more 
35 preferably femtomole or 1 ss, in the presence of other 
compoxinds which may be present at significantly higher 
levels. For this reason, sp cific atomic substitutions 
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may be used to render the labels easily detectable. Such 
substitutions include: 

(a) substitution by electronegative elements, e.g. 
fluorine or chlorine, for electron capture detection in 

5 conjunction with capillary GC or negative ion mass 
spectroscopy detection; 

(b) substitution by an uncommon element (excluding C, H, 
and O) for atomic emission detection in conjunction with 
capillary GC; 

10 (c) substitution by several uncommon elements for atomic 
emission detection to determine the ratio between the 
elements; 

(d) siabstitution by a radioactive element, e.g. ^H, for 
detection by autoradiography or scintillation counting in 

15 conjunction with LC, TLC or electrophoresis; 

(e) siibstitution by a multiplicity of radioactive elements 
having differing emissions, e.g. and "C, for detection 
by autoradiography or scintillation counting to determine 
the ratio of the different radioactive elements. 

20 

For single -element substitution (a., b., d. above) a 
separad^le mixture of A tags whose simple presence or 
absence can be detected would encode up to 2^ different 
syntheses. For multiple-element substitution (see, c. and 
25 e * above ) a separable mixture of A tags each having B 
distinguishable states {e.g., different W"C ratios, 
different Si/Sn ratios) would be able to encode for up to 
different syntheses. 

30 A wide variety of isotopes exist, where the presence or 
ratio of isotopes may provide information as to stage and 
choice. The isotopes may be radioactive or non- 
radioactive. Isotopes of particular interest include 
deuterium, tritium, "C, ^^P, "^I, etc. 

35 

By employing mixtures of isotopically-modif ied compounds, 
one can cfreatly expand the information obtained from a 
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single tag compound which is only distinguished by the 
presence of isotopes. For example, one could prepare a 
mixture of ratios of hydrogen to deuterium, where the 
various ratios could differ by as little as 10% each. 

5 

By replacing hydrogens with another atom, such as 
fluorine, one would then have a varying mixture of 
hydrogens, deuteriums and fluorines, providing for a large 
number of different dif f erentiable tags. 

10 

Other groups that may be involved could be aromatic rings, 
which are differentially substituted, as to position and 
functionality. Thus, by having substituted benzene rings, 
where the position of the substitution and the nature of 

15 the substitution can be determined, one can provide for a 
plurality of molecules which can be distinguished and can 
provide for both stage and choice information. For 
example, if C were constant one could detect and 
discriminate through the substitution pattern on E when E 

20 is a polyhalogenated aromatic ring. 

There is also the possibility to use fluorescent tags. 
While fluorescent tags alone may not be sufficient to 
define a significant number of stages with a significant 
25 number of choices, as referred to above, by providing for 
means for separating the fluorescent tagging molecules 
based on variations in C or , one can individually 
detect the tags by their fluorescence. 

30 The mixture of tags associated with a particular bead may 
be detached and subject to an initial separation, where it 
is desirable to detect each of the tags separately. Once 
the group of tags has been separated, each of the tags may 
then be analyzed based on its particular fiinctionalities 

35 and distinctive properties. Various techniques which may 
be used to detect the particular tags include 
autoradiography or scintillation counting, electron 
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spectroscopy, infrared spectroscopy, ultraviolet 
spectroscopy, electron spin resonance spectroscopy, 
fluorescence, and the like, 

5 

Another composition may have at least 6 different markers 
being associated in a kit or in a common medium, each 
marker having a distinguishable moiety which is 
substantially chemically inert, differing from each other 
10 in molecular weight, said markers of the formula: 

(1) A-{A-(T)« or (T)„-A or A^-(T)„-a2} 

where A is a linking group which has a functionality for 
15 bonding to a solid support and a functionality for 
detachment from the solid support, which may be included 
in the functionality for bonding to the solid support; 

A is a distinguishing group, which allows for the 
20 distinguishing by a physical characteristic of each of the 
markers from each of the other markers by other than 
fluorescence, so as to provide a set of markers which 
allows for coding a multistep synthetic procedure, and 
includes any remaining functional group after detachment, 
25 which was previously associated with the linked group; 

A^ and A^ are portions of the distinguishing group, which 
together define a distinguishing group and when joined 
together come within the definition of A; 

30 

T is a detectable group, which when attached to the 
distinguishing group allows for the low level detection of 
the marker, where the detectable group may be present on 
. the marker in the kit or may be added later to the 
35 distinguishing group and if attached to the linking group, 
includes any remaining functional group after detachment, 
which was previously associated with the linking group; 
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and a is 0 or 1, indicating that the detectable group may 
or may not be present; 

(2) ss-{A'-{A-(T)„ or {T)«-A or A^- (T)„-a2} ) ^ 

5 

where all of the symbols have been defined previously, 
except as follows: ss is a solid support; A' is a linking 
group covalently bonded to ss; and ^ is an integer as to 
each solid support and is at least six and usually not 
10 more than about 30; 

(3) A"-{A"-(T)„ or (T")«-A or A'^-(T)„-A2} 

where all of the symbols have been defined previously, 
15 except as follows: A" is hydrogen or the residue of the 
linking group after photolytic cleavage, elimination or 
other chemical reaction which results in detachment from 
the solid support; A" or A'^ is A or A^ or a modified A or 
A^, respectively, as a result of the detachment of the 
20 marker from the solid support; T" is T or a modified T as 
a result of the detachment of the marker from the solid 
support ; 

(4) T„-A' "-{A"- {T)« or (T")«-A or A'^-(T)„-A2} 

25 

where all of the symbols have been defined previously 
andA' " is a bond or the remaining portion of the linker 
group after attachment to T; with the additional 
limitation that only one ck is 1. 

30 

Assays 

To determine the characteristic of , interest of the 
product, a wide variety of assays and techniques may be 
employed. 

35 

Frequently, in screening the beads, one will use either 
single beads or mixtures of beads and determine whether 
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the bead or mixtures show activity. Thus, the mixtures 
may involve 10, 100, 1000 or more beads. In this way, 
large groups of compounds may be rapidly screened and 
segregated into smaller groups of compounds. 

5 

One technique is where one is interested in binding to a 
particular biomolecule such as a receptor. The receptor 
may be a single molecule, a molecule associated with a 
microsome or cell, or the like. Where agonist activity is 

10 of interest, one may wish to use an intact organism or 
cell, where the response to the binding of the subject 
product may be measured. In some instances, it may be 
desirable to detach the product from the bead, 
particularly where physiological activity by transduction 

15 of a signal is of interest. Various devices are available 
for detecting cellular response , such as a 
microphysiometer, available from Molecular Devices, 
Redwood City, CA. Where binding is of interest, one may 
use a labeled receptor, where the label is a fluorescer, 

20 enzyme, radioisotope, or the like, where one can detect 
the binding of the receptor to the compound on the bead. 
Alternatively, one may provide for an antibody to the 
receptor, where the antibody is labeled, which may allow 
for amplification of the signal and avoid changing the 

25 receptor of interest, which might affect its binding to 
the product of interest. Binding may also be determined 
by displacement of a ligand bound to the receptor, where 
the ligand is labeled with a detectable label. 

30 In some instances, one may be able to carry out a 
two- stage screen, whereby one first uses binding as an 
initial screen, followed by biological activity with a 
viable cell in a second screen. By employing recombinant 
techniques, one can greatly vary the genetic capability of 

35 cells. One can then produce exogenous genes or exogenous 
transcriptional regulatory sequences, so that binding to 
a surface membrane protein will result in an observable 
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signal , e.g. an intracellular signal . For example , one 
may introduce a leuco dye into the cell, where an enzyme 
which transforms the leuco dye to a colored product, 
particularly a fluorescent product, becomes expressed upon 
5 appropriate binding to a surface membrane, e.g. 
jS-galactosidase and digalactosidylf luorescein. In this 
manner, by associating a particular cell or cells with a 
particular particle, the fluorescent nature of the cell 
may be determined using a FACS, so that particles carrying 

10 active compounds may be ioentified. Various techniques 
may be employed to ensure chat the particle remains bound 
to the cell, even where the product is released from the 
particle. For example, one may use antibodies on the 
particle to a surface membrane protein, one may link 

15 avidin to the surface of the cell and have biotin present 
on the particle, etc. 

Assays may be performed stagewise using individual 
particles or groups of particles or combinations thereof. 

20 For exan^le, after carrying out the combinatorial 
syntheses, groups of about 50 to 10,000 particles may be 
segregated in separate vessels. In each vessel, as to 
each particle a portion of the product bound to the 
particle is released. The fractional release may be as a 

25 result of differential linking of the product to the 
particle or using a limited amount of a reagent, condition 
or the like, so that the average number of product 
molecules released per particle is less than the total 
number of product molecules per particle. One woui'd then 

30 have a mixture of products in a small volume. The mixture 
could then be used in an assay for binding, where the 
binding event could be inhibition of a known binding 
ligand binding to a receptor, activation or inhibition of 
a metabolic process of a cell, or the like. Various assay 

35 conditions may be used for the detection of binding 
activity as will be described subsequently- Once a group 
is shown to be active, the individual particles may then 
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be screened, by the same or a different assay. One could 
of course, have a three- or four-stage procedure, where 
large groups are divided up into smaller groups, etc. and 
finally single particles are screened. In each case, 
5 portions of the products on the particles would be 
released and the resulting mixture used in an appropriate 
assay. The assays could be the same or different, the 
more sophisticated and time consuming assays being used in 
the later or last stage. 

10 

One may also provide for spatial arrays, where the 
particles may be distributed over a honeycomb plate, with 
each well in the honeycomb having 0 or 1 particle . 

15 The subject methodology may be used to find chemicals with 
catalytic properties, such as hydrolytic activity, e.g. 
esterase activity. For this purpose one might embed beads 
in a semisolid matrix surrounded by diffusible test 
substrates. If the catalytic activity can be detected 

20 locally by processes that do not disturb the matrix, for 
example, by changes in the absorption of light or by 
detection of fluorescence due to a cleaved substrate, the 
beads in the zone of catalytic activity can be isolated 
cuid their labels decoded. 

25 

Instead of catalytic activity, compounds with inhibitory 
or activating activity can be developed. Compounds may be 
sought that inhibit or activate an enzyme or block a 
binding reaction. To detect beads that inhibit an enzyme, 

30 which beads have an attached product with this desirable 
property, it is advantageous to be able to release the 
products from the beads, enabling them to diffuse into a 
semisolid matrix or onto a filter where this inhibition, 
activation or blocking can be observed. The beads that 

35 form a visualized or otherwise detectable zone of 
inhibition, activation or blocking can then be picked and 
the tags decoded. In this case it is necessary that a 
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portion of the synthesized products be attached to the 
beads by cleavable linkages, preferably a photolabile 
linkage, while a portion of the tags remain attached to 
the bead, releasable after picking by a different means 
5 than before. 

A dialysis membrane may be employed where a layer of beads 
is separated from a layer of radiolabeled ligand/receptor 
pair. The bead layer could be irradiated with ultraviolet 

10 light and the product released from the bead would diffuse 
to the pair layer, where the radiolabeled ligand would be 
released in proportion to the affinity of the compound for 
the receptor. The radiolabeled ligand would diffuse back 
to the layer of beads. Since the radiolabel would be 

15 proximal to the bead, beads associated with radioemission 
would be analyzed. 

Of particular interest is finding products that have 
biological activity. In some applications it is desirable 

20 to find a product that has an effect on living cells, such 
as inhibition of microbial growth, inhibition of viral 
growth, inhibition of gene expression or activation of 
gene expression. Screening of the compounds on the beads 
can be readily achieved, for example, by embedding the 

25 beads in a semisolid medium and the library of product 
molecules released from the beads (while the beads are 
retained) enabling the compounds to diffuse into the 
surrounding medium. The effects, such as plaques within 
a bacterial lawn, can be observed. Zones of growth 

30 inhibition or growth activation or effects on gene 
expression can then be visualized and the beads at the 
center of the zone picked and analyzed. 

One assay scheme will involve gels where the molecule or 
35 system, e.g. cell, to be acted upon may be embedded 
substantially homogeneously in the gel. Various gelling 
agents may be used such as polyacrylamide, agarose, 



wo 95/28640 



PCT/US95/04683 



-64- 

gelatin, etc. The particles may then be spread over the 
gel so as to have sufficient separation between the 
particles to allow for individual detection. If the 
desired product is to have hydrolytic activity, a 
5 sxibstrate is present in the gel which would provide a 
fluorescent product. One would then screen the gel for 
fluorescence and mechanically select the particles 
associated with the fluorescent signal . 

10 One could have cells embedded in the gel, in effect 
creating a cellular lawn. The particles would be spread 
out as indicated above. Of course, one could place a grid 
over the gel defining areas of one or no particle. If 
cytotoxicity were the criterion, one could release the 

15 product, incubate for a sufficient time, followed by 
spreading a vital dye over the gel. Those cells which 
absorbed the dye or did not absorb the dye could then be 
distinguished . 

20 As indicated above, cells can be genetically engineered so 
as to indicate when a signal has been transduced. There 
are many receptors for which the genes are known whose 
expression is activated. By inserting an exogenous gene 
into a site where the gene is under the transcriptional 

25 control of the promoter responsive to such receptor, an 
enzyme can be produced which provides a detectable signal, 
e.g. a fluorescent signal. The particle associated with 
the fluorescent cell(s) may then be analyzed for its 
reaction history. 

30 

Libraries and Kits 

For convenience, libraries and/or kits may be provided. 
The libraries would comprise the particles to which a 
library of products and tags have been added so as to 
35 allow for screening of the products bound to the bead or 
the libraries would comprise the products removed from 
the bead and grouped singly or in a set of 10 to 100 to 
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1000 members for screening. The kits would provide 
various reagents for use as tags in carrying out the 
library syntheses. The kits will usually have at least 4, 
usually 5, different compounds in separate containers, 
5 more usually at least 10, and may comprise at least 10^ 
different separated organic compounds, usually not more 
than about 10^, more usually not more than about 36 
different compounds. For binary determinations, the mode 
of detection will usually be common to the compounds 

10 associated with the analysis, so that there may be a 
common chromophore, a common atom for detection, etc. 
Where each of the identifiers is pre -prepared, each will 
be characterized by having a distinguishable composition 
encoding choice and stage which can be determined by a 

15 physical measurement and including groups or all of the 
compounds sharing at least one common functionality. 

Alternatively, the kit can provide reactants which can be 
combined to provide the various identifiers. In this 

20 situation, the kit will comprise a plurality of separated 
first functional, frequently bifunctional, ' organic 
compounds, usually four or more, generally one for each 
stage of the synthesis, where the functional organic 
compounds share the same f\inctionality and are 

25 distinguishable as to at least one determinable 
characteristic. In addition, one would have at least one, 
usually at least two, second organic compounds capable of 
reacting with a functionality of the functional organic 
compounds and capable of forming mixtures which are 

30 distinguishable as to the amount of each of said second 
orgcuiic compounds. For example, one could have a glycol, 
amino acid, or a glycolic acid, where the various 
bifunctional compounds are distinguished by the number of 
fluorine or chlorine atoms present, to define stage, and 

3 5 have an iodomethane , where one iodomethane has no 
radioisotope, another has "C and another has one or more 
^H. By using two or more of the iodomethanes , one could 
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provide a variety of mixtures which could be determined by 
their radioemissions . Alternatively, one could have a 
plurality of second organic compounds, which could be used 
in a binary code. 

5 

As indicated previously one could react the tags after 
release with a molecule which allows for detection- In 
this way the tags could be quite simple, having the same 
functionality for linking to the particle as to the 

10 detectable moiety. For example, by being linked to a 
hydroxycarboxyl group, a hydroxyl group would be released, 
which could then be esterified or etherified with the 
molecule which allows for detection. For example, by 
using combinations of fluoro- and chloroalkyl groups, in 

15 the binary mode, the number of fluoro and/or chloro groups 
could determine choice, while the number of carbon atoms 
would indicate stage. 

Groups of compounds of particular interest include 
20 distinguishing groups such as C joined to a substituted 
ortho-nitrobenzyloxy group, indanyloxy or fluorenyloxy 
group, or other group which allows for photolytic or other 
selective cleavage. The distinguishing group may be an 
alkylene group of from 2 to 20 carbon atoms, 
25 polyalkyleneoxy, particularly alkyleneoxy of from 2 to 3 
carbon atoms, cycloalkyl group of from 4 to 8 carbon 
atoms, haloalkyl group, particularly fluoroalkyl of from 
2 to 20 carbon atoms, one or more aromatic rings and the 
like, where the distinguishing group provides for the 
3 0 discrimination between the various groups, by having 
different numbers of units and/or siibstituents. 

Individual particles or a plurality of particles could be 
provided as articles of commerce, particularly where the 
35 particle (s) have shown a characteristic of interest. 
Based on the associated tags, the reaction history may be 
decoded. The product may then be produced in a large 
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synthesis. Where the reaction history unequivocally 
defines the strxicture, the same or analogous reaction 
series may be used to produce the product in a large 
batch. Where the reaction history does not unambiguously 
5 define the structure, one would repeat the reaction 
history in a large batch and use the resulting product for 
structural analysis. In some instances it may be found 
that the reaction series of the combinatorial chemistry 
may not be the preferred way to produce the product in 
10 large amounts. 

An embodiment of this invention is a kit comprising a 
plurality of separated organic compounds, each of the 
compounds characterized by having a distinguishable 
15 composition, encoding at least one bit of different 
information which can be determined by a physical 
measurement, and sharing at least one common 
functionality. A preferred embodiment is a kit comprising 
at least 4 different functional organic compoiinds. 

20 

More preferred is a kit wherein said functional organic 
compounds are of the formula: 

F1.f2_C-E-C' I 

and the formula: 
25 F^-F^- (C(E-£' )a)b (la) 

where F^-F^ is a linker which allows for attachment to and 
detachment from a solid particle; and 

C-E-C' is a tag member which can be determined by a 
physical measurement, especially wherein said fiinctional 
30 organic compounds differ by the number of methylene groups 
and/or halogens, nitrogens or sulfurs present. 

Also preferred is a kit wherein the C-E-C' portion is 
removed photochemically or a kit wherein the C-E-C' 
35 portion * is removed oxidatively, hydrolytically, 
thermolytically, or reductively. 
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In one embodiment the invention is a composition 
comprising at least S different components, each component 
having a distinguishable moiety. The components may be 
characterized by each moiety being substantially 
5 chemically stable or inert and having an identifiable 
characteristic different from each of the other moieties. 
Each moiety is joined to a linking group having an active 
functionality capable of forming a covalent bond through 
a linking group to individur.lly separable solid surfaces, 

10 or joined to a group which is detectable at less than 1 
nanomole, with a proviso that when the moieties are joined 
to the linking group, the components are physically 
segregated. Preferably, the solid supports are beads. 
In one embodiment each component comprises molecules of 

15 different compounds bound to individual separable solid 
surfaces, wherein the molecules on the solid surfaces . 
Preferably, the moieties of the invention define an 
homologous series and/or a series of substitutions on a 
core molecule. 

20 

The invention herein is also directed to a compound 
library comprising at least one hundred unique solid 
supports. In this compound library each solid support has 
(1) an individual compound bound to the solid support as 

25 a major compound bound to the support; and (2) a plurality 
of tags e.g. tags incapable of being sequenced, where the 
tags are individual tag molecules which are physically 
distinguishable in being physically separable and are 
substituted so as to be detectable at less than about a 

30 nanomole or have a functional group for bonding to a 
substituent which is detectable at less than about at 
nanomole. Preferably, in the compound library each solid 
support has at least about 6 tags. In smother embodiment, 
in the compoxind library the tags define a binary code 

35 encoding the synthetic protocol used for the synthesizing 
of the compound on the solid support. 
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This invention also provides a method for determining a 
synthetic protocol encoded by separable physically 
different tags in a series and defining a binary code. In 
this method at least two tags are employed to define each 
5 stage of the synthetic protocol, there being at least six 
tags. The step of the method comprises separating tags by- 
means of their physical differences and detecting the 
tags. The synthetic protocol is defined a binary code of 
different tags. 

10 

Compounds of this invention may be useful as analgesics 
and/or for the treatment of inflammatory disease, 
especially in the case of the azatricyclics acting as 
antagonists of the neurokinin l/bradykinin receptor. 
15 Members of the benzediazopine library may be useful as a 
muscle relaxant and/or tranquilizer and/or as a sedative. 
Members of the 23.5 Million Mixed Amide Library (Example 
6) may be of use in the treatment of hypertension or 
Raynaud's syndrome by acting as endothelin antagonists. 

20 

EXAMPLE 1 

PEPTIDE LIBRARY 
In order to encode up to 10' different syntheses, one could 
prepare 30 different identifiers which carxy individual 

25 tags capable of being separated one from another by 
capillary GC. For encoding a smaller number of syntheses, 
fewer identifiers would be used. The tags would normally 
be prepared from commercially-available chemicals as 
evidenced by the following illustration. 

30 w-Hydroxyalkenes-l/ where the number of methylene groups 
would vary from 1 to 5, would be reacted with an 
iodoperf luoroalkane, where the number of CF^ groups would 
be 3, 4, 6, 8, 10, and 12. By employing a free-radical 
catalyst, the iodoperf luorocarbon would add to the double 

35 bond, where the iodo group could then be reduced with 
hydrogen and a catalyst or a tin hydride. In this manner, 
30 different tags could be prepared. The chemical 
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procedure is described by Haszeldine and Steele, J. Chem. 
Soc. (London), 1199 (1953); Brace, J. Fluor. Chem. , 20, 
313 (1982) . The highly fluorinated tags can be easily 
detected by electron capture, have different GC retention 
5 times, so that they are readily separated by capillary GC, 
are chemically inert due to their fluorinated, hydrocarbon 
structure and each bears a single hydroxyl functional 
group for direct or indirect attachment to particles . 

10 Before attachment to compound precursors, the tags 
(referred to as T1-T30) would be activated in a way which 
is appropriate for the chemical intermediates to be used 
in the combinatorial synthesis. By appropriate it is 
intended that a functionality would be added which allows 

15 for ready attachment by a chemical bond to a compound 
precursor or to the bead matrix itself. The activation 
process would be applied to each of the 30 different tags 
and allow these tags to be chemically bound, either 
directly or indirectly, to intermediates in the 

20 combinatorial compound synthesis. For example, a carboxy 
derivative could be used for coupling and upon activation 
the resulting carboxy group would bond to the particle. 

In the case of a combinatorial synthesis of a peptidic 
25 compound or other structure made of amide -linked organic 
fragments, the encoding process could consist of addition 
of a carboxylic acid-equipped linker. For example, the 
tag would be coupled to the tgrt. -butyl ester of o-nitro- 
E-carboxybenzyl bromide in the presence of sodium hydride. 
30 The ester would then be hydrolyzed in dilute 
trif luoroacetic acid. 

Activated identifiers would be coupled to intermediates at 
each stage in the combinatorial compound synthesis. The 
35 ortho-nitrobenzyl ether part of the activated identifiers 
is used to allow photochemical detachment of the tags 
after completing the combinatorial synthesis and selecting 



wo 95/28640 PCTAJS95/04683 

-71- 

the most desirable compounds. The detached tags would 
then be decoded using capillary GC with electron capture 
detection to yield a history of the synthetic stages used 
to prepare the compound selected. 

5 

While there is an almost unlimited set of chemical stages 
and methods which could be used to prepare combinatorial 
libraries of compounds, we will use coupling of Of-amino 
acids to make a combinatorial library of peptides as an 

10 example of an application of the encoding methodology. In 
this example, we will describe preparation of a library of 
pentapeptides having all combinations of 16 different 
amino acids at each of the five residue positions. Such 
a library would contain 16^ members. To uniquely encode 

15 all members of this library, 20 detachable tags (T1-T20) 
as described above would be required. 

To prepare the encoded library, we would begin with a 
large number {>10^) of polymer beads of the type used for 

20 Merrifield solid phase synthesis and functionalized by 
free amino groups. We would divide the beads into 16 
equal portions and place a portion in each of 16 different 
reaction vessels (one vessel for each different a- amino 
acid to be added) . We would then add a small portion 

25 (e.g., 1 mol%) of identifiers to each of the amino acid 
derivatives (e.g., Fmoc amino acids) to be coupled in the 
first stage of the combinatorial synthesis. The specific 
combination of the tags incorporated into the identifiers 
added would represent a simple binary code which 

30 identifies the amino acid used in the first stage of 
synthesis. The 16 amino acids added would be indicated by 
numbers 1-16 and any such number could be represented 
chemically by combinations of the first four tags (T1-T4) . 
In tables 2 suid 3, a typical encoding scheme is shown in 

35 which the presence or absence of a tag is indicated by a 
1 or a 0, respectively. The letter T may represent 
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either the the tag or the identifier incorporating that 
tag. 

5 Table 2 . A typical encoding scheme . 



15. 



Amino Acid added in first 
stage 


T4 


T3 


T2 


Tl 


Number 1 (e.g., glycine 


0 


0 


0 


0 


Number 2 (e.g., alanine) 


0 


0 


0 


1 


Number 3 (e.g., valine) 


0 


0 


1 


0 


Number 4 (e.g., serine) 


0 


0 


1 


1 


Number 5 (e.g., threonine) 


0 


1 


. 0 


0 






















Number 16 (e.g., tryptophan) 


1 


1 


1 


1 



20 We would then carry out a standard dicyclohexyl- 
carbodiimide (DCC) peptide coupling in each of the 16 
vessels using the Fmoc amino acids admixed with small 
amounts of the encoding activated identifiers as indicated 
above. During the couplings, the amino acids as well as 

25 small amounts (e.g., 1%) of the identifiers would become 
chemically bound to intermediates attached to the beads. 

Next the beads would be thoroughly mixed and again 
separated into 16 portions. Each portion would again be 
30 placed in a different reaction vessel. A second amino 
acid admixed with appropriate new activated identifiers 
(T5-T8) would be added to each vessel and DCC coupling 
would be carried out as before. The particular mixture of 
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the incorporated tags (T5-T8) would again represent a 
simple binary code for the amino acid added, in this, the 
second stage of the combinatorial synthesis - 

5 Table 3 . A\ typical encoding scheme . 



15 



Amino Acid added in second 
stage 


T8 


T7 


T6 


T5 


Number 1 {e.g. , glycine 


0 


0 


0 


0 


Number 2 (e.g., alanine) 


0 


0 


0 


1 


Number 3 {e.g., valine) 


0 


0 


1 


0 


Number 4 {e.g., serine) 


0 


0 


1 


1 


Number 5 {e.g., threonine) 


0 


1 


0 


0 






















Number 16 {e.g., tryptophan) 


1 


1 


1 


1 



After the 16 couplings of stage 2 are complete, the beads 
would be again mixed emd then divided into 16 new portions 

20 for the third stage of the synthesis. For the third 
stage, T9-T12 would be used to encode the third amino acid 
boxmd to the beads using the same scheme used for stages 
1 and 2. After the third couplings, the procedure would 
be repeated two more times using the fourth amino acids 

25 with T13-T16 and the fifth amino acids with T17-T20 to 
give the entire library of 1,048,576 different peptides 
bound to beads. 

Although the above beads .would be visually 
30 indistinguishable, any bead may be chosen {e.g., by 
selecting based on the interesting chemical or biological 
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properties of its bound peptide or other target molecule) 
and its synthetic history may be learned by detaching ,and 
decoding the associated tags. 

5 The precise method used to detach tags will depend upon 
the particular linker used to chemically bind it to 
intermediates in the combinatorial synthesis of the target 
compound. In the example above, the or tho-nitrobenzyl 
carbonate linkages, which are known to be unstable to 

10 -300 nm light (Ohtsuka, et al., J. Am. Chem. Soc, 100, 
8210 [1978] ) , would be cleaved by photochemical 
irradiation of the beads. The tags would then diffuse 
from the beads into free solution which would be injected 
into a capillary gas chromatograph (GO equipped with a 

15 sensitive electron capture detector. Since the order in 
which the tags (T1-T20) emerged from the GC and their 
retention times under standard conditions were previously 
determined, the presence or absence of any of T1-T20 would 
be directly determined by the presence or absence of their 

20 peaks in the GC chromatogram . If 1 and 0 represent the 
presence and absence respectively of peaks corresponding 
to T1-T20, then the chromatogram can be taken as a 
20 -digit binary number which can uniquely represent each 
possible synthesis leading to each member of the peptide 

25 library. The use of halocarbon tags which are safe, 
economical ^ and detectable at subpicomole levels by 
electron capture detection makes this capillary GC method 
a particularly convenient encoding scheme for the purpose. 
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As an example of using the encoding scheme for the 
pentapeptide library above, a particular bead is 
irradiated with light to detach the tags, the solubilized 
labels injected into a capillary GC and the following 
5 chromatogram obtained ("Peak" line) : 



Label 20 19 18 17 16 15 U 13 12 11 10 9 8 7 6 5 4 3 2 1 GC Inject 

10 Peak 1 I I I I II I I I 

Binary 1111 0100 OGII 0001 0010 

Stage 5 4 2 1 

15 

AA Tryptqahan Threonine Serine Alanine Valine 



The "Label" line diagrams the GC chromatogram where T20-T1 
20 peaks (|) are to be found (note the injection is given on 
the right and the chromatogram reads from right to left) . 
The "Peak" line represents the presence of labels (T20-T1) 
as peaks in the chromatogram. The "Binary" line gives 
presence (1) or absence (0) of peaks as a binajry number.- 
25 The "Stage" line breaks up the binary number into the five 
different parts encoding ths five different stages in the 
synthesis. Finally, the ''AA" line gives the identity of 
the amino acid which was added in each stage and was given 
by the binary code in the "Binary" line above. 
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EXAMPLK 2 

RADIO-LABELED TAGS . 
In the next illustration, the tags employed are 
monomethylethers of linear alkyl-a, w-diols . The diol 
5 would have N + 2 carbon atoms, where N designates the 
stage. The methyl group would be a radiolabeled reagent 
which would have any of a variety of ^H/"C ratios from 1/1 
to m/1, where m is the number of choices. The double 
radiolabel allows for accurate quantitation of the tritium 

10 present in the tag. By having 10 different alkylene 
groups and 10 different radioactive label ratios, 10" 
unique ten-member sets of tags are generated. Tags would 
be attached by first reacting them with activating agents, 
e.g. phosgene to form a chlorof ormate , followed by 

15 reaction with the F^-F^ component. In this case, F^-P^ is 
the o-nitro-p-carboxy-benzyl alcohol protected as the t- 
butyl ester. Each time a synthetic stage is carried out, 
the de-esterif ied identifier is added directly to the 
bead, which has covalently bonded amine or hydroxy 1 

20 groups, to form amides or esters with the acid activated 
using standard chemistry, e.g., carbodiimide coupling 
methodology. At the end of the secjuential synthesis, the 
beads are then screened with a variety of receptors or 
enzymes to determine a particular characteristic. The 

25 beads demonstrating the characteristic may then be 
isolated, the tags detached and separated by HPLC to give 
a series of glycol monomethyl ethers which may then be 
analyzed for radioactivity by standard radioisotope 
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identif ication methods. For example, if the first and 
second tags to elute from the HPLC column had W^*C ratios 
of 5:1 and 7:1 respectively, then the product which showed 
activity had been synthesized by reagent number 5 in 
5 stage 1 and reagent number 7 in stage 2 . 

EXAMPLE 3 

24 01 Peipi-idR Library 
The identifiers employed were 2-nitro-4-carboxybenzyli 

10 0-aryl substituted co-hydroxyalkyl carbonate, where alkyl 
was of from three to 12 carbon atoms and aryl was (A) 
pentachlorophenyl, (B) 2,4,6-trichlorophenyl, or (C) 2,6- 
dichloro-4-fluorophenyl. The tags are designated as NAr, 
wherein N is the number of methylene groups minus two and 

15 Ar is the aryl group. Thus, tag 2A has a butylene group 
bonded to the pentachlorophenyl through oxygen. The 
subject tags can be easily detected using electron capture 
gas chromatography at about 100 fmol. 

20 In the subject analysis, the tagging molecules are 
arranged in their GC elution order. Thus the tag which is 
retained the longest on the GC column is designated Tl and 
is associated with the least significant bit in the binary 
synthesis code number, the next longest retained tag is 

25 called T2 representing the next least significant binary 
bit, and so on. Using an 0.2mM x 20M methy.lsilicone 
capillary GC column, eighteen well-resolved tags were 
obtained where Tl through T18 corresponded to lOA, 9A, 8A, 
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7A, €A, 5A, 4A, 3A, 6B, 2A, 5B, lA, 4B, 3B, 2B, IB, 2C, 
and IC, respectively. 

An encoded combinatorial library of 2401 peptides was 
5 prepared. This library had the amino acid sequence N- 
XXXXEEDLGGGG-bead, where the variable X residues were D, 
E, I, K, L, Q, or S (single letter code) . The 4 glycines 
served as a spacer between the encoded amino acid sequence 
and the bead. The combinatorial library included the 

10 sequence H2N-KLISEEDL, part of the 10 amino acid epitope 
which is known to be bound by 9E10, a monoclonal antibody 
directed against the human C-myc gene product. For 
encoding this library, three binary bits were sufficient 
to represent the seven alternative reagents for each 

15 stage. The code was as follows: 001 = S; 010 = I; Oil = 
K; 100 = L; 101 = Q; 110 = E; 111 = D. 



The library was synthesized by first preparing the 
constant segment of the library HaNEEDLGGGG-bead on 1.5 g 

20 of 50-90^ polystyrene synthesis beads functionalized with 
1.1 meq/g of aminomethyl groups using standard solid phase 
methods based on t. -butyl side-chain protection and Fmoc 
main chain protection (Stewart and Young, "Solid Phase 
Peptide Synthesis", 2nd edition, Pierce Chemical Co., 

25 1984) . After deprotecting the Fmoc groups with 
diethylamine, the beads were divided into seven 200 mg 
fractions and each fraction placed in a different 
Merrifield synthesis vessel mounted on a single wrist- 
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action shaker. The beads in the seven vessels were 
processed independently as follows (see Table 3-1) . The 
letter T in this example refers to the tag or to the 
identifier incorporating that tag. 

5 

TABLE 3-1 



Ves 
sel 
No. 


Step 1 


Step 2 


Step 3 


Step 4 


1 


1%T1 


Die, wash 


Fmoc(tBu)S, Anh. 


Wash 


2 


1%T2 


It 


FmocI, Anh. 


ri 


3 


1%T1,T2 


II 


Fmoc(Boc)K, Anh. 


11 


4 


1%T3 


n 


FmocL, Anh. 


TI 


5 


1%T1,T3 


ti 


Fmoc (trityl) Q/ 
Anh, 


11 


6 


1%T2,T3 


n 


Fmoc (t-butyl)E, 
Anh. 


n 


7 


1%T1 , T2 , T3 


II 


Fmoc(tBu)D, Anh. 


II 



In accordance with the above procedure a sufficient amount 
20 of the identifiers listed in step 1 were attached via 
their carboxylic acids using diisopropylcarbodiimide to 
tag about 1% of the free amino groups on each bead in the 
corresponding vessel. The remaining free amino groups on 
each bead were then coupled in step 3 to N-protected amino 
25 acid anhydrides. After washing with methylene chloride, 
isopropanol, and N,N-dimethylformamide, the beads from the 
seven vessels were combined and thoroughly mixed. At this 
point the library had seven members. 
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After Fmoc deprotection (diethylamine) , the beads were 
again divided into seven vessels and processed as before 
except that in place of the identifiers used previously, 
identifiers representing the second stage (T4-6) were 
5 used. By repeating the procedure two more times, using 
identifiers T7-9 and then TlO-12 analogously, the entire 
uniquely encoded library of 7^=2401 different peptides was 
prepared using only 12 identifiers. 

10 To read the synthesis code from a single selected bead, 
the bead was first washed four times in a small centrifuge 
tube with 100 portions of DMF, and then resuspended in 
1 fill of DMF in a Pyrex capillary tube. After 2 hrs of 
^ photolysis with a Rayonet 350 nm light source, the tags 

15 released from the bound identifiers were silylated using 
about 0.1 fill bis-trimethylsilylacetamide and the solution 
injected into a Hewlett Packard capillary gas 
chromatograph equipped with an 0.2mM x 20M methylsilicone 
fused silica capillary column and an electron capture 

20 detector- The binary synthesis code of the selected bead 
was directly determined from the chromatogram of the tags 
which resulted. 
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EXAMPLE 4 

Benzodiazepine Library 
A combinatorial benzodiazepine library comprising 
compounds of the formula VIII 

5 



CI 




VIII 



wherein: 

R is CHj, CH(CH3)2, CH2CO2H, (CH2)4NH2, CHjCgH^OH, or CHjCg 
and 

15 is H, CH3, CjHs, CHaCHsCHa, or CHjCgHs 

is constructed per the following scheme. 
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STEP 



^ (pT^n 




IV 0 HNFmoc 

© = POLYSTYREHE RESIN 



STEP D 



1) TAGS Eg. 



2) 20X PIPERIDINE IN DHF 

R 

FmocN CO 
H 1 



3) 




3) 5 ACOH/DMF 

60 "C 
STEP E 
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STEP E 




STEP F 



1) LITHIATED 5(PHEHYLMETHYL)-2- 
OXAZOLIDINOHE 
THF, 



0 

-78 C 



2) R X. DMF 

X=BROMINE OR lODIME 

3) TFAiHgOiDDIETHYLSULFIDE 

95:5:10 




STEP G 



hv (350 ni) 
DMF 
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STEP G 

▼ 




0 
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The benzodiazepines VIII are constructed on polystyrene 
beads similarly to the method of Bunin and Ellman (JACS, 
114, 10997-10998 [1992] ) except that a photolabile linker 
5 is incorporated between the bead and the benzodiazepine 
(see steps A, B, and C) , thus allowing the benzodiazepine 
to be removed in step G non-hydrolytically by exposure to 
U,V. light (350 nm in DMF for 10 minutes to 12 hr) . 
Additionally, binary codes are introduced in steps D and 
10 E which allow for a precise determination of the reaction 
sequence used to introduce each of the 6 R's cind 5 R^'s. 
After removal of the tags according to step H and analysis 
by electron capture detection following GC separation, the 
nature of the individual R and R^ groups is determined, 

15 

Steps D, and F essentially follow the procedure of 

Bunin and Ellman, but also include the incorporation of 
identifiers IXa-c in step D and IXd-f in Step E. The 
identifiers are all represented by Formula IX, 

20 




IX 
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wherein : 

IXa indicates n=6; 
IXb indicates n=5; 
IXc indicates n=4; 
5 IXd indicates n=3; 

IXg indicates n=2; and 
IXf indicates n=l. 

I 

The codes for each of R and are as follows: 
10 Table 4-1 



IX 


R 


a 


CH3 


b 


CH (013)2 


a,b 


CH2CO2H 


c 


(CH2)4NH2 


a,c 


CH2-C6H4-4-OH 


b,c 


CH2^6^5 


IX 


Si 


d 


H 


e 


CH3 


d,e 


C2H5 


f 


CH2CH=CH2 


d,f 


CH2C6H5 
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Step A 

To a solution of I (1 equiv) in toluene (cone. = 0.5 M) is 
added the Fmoc protected 2-amino-5-chloro-4 ' -hydroxy- 
benzophenone {1.3 eq) and diethylazaodicarboxylate (1.3 eq) 
5 and triphenylphosphine (1.3 eq) . The mixture is stirred 
at room temperature for 24 hr. The solvent is removed in 
vacuo and the residue triturated with ether and filtered 
and the solvent again removed in vacuo . The resultant 
product II is purified by chromatography on silica gel. 

10 

Step B 

To a solution of II in DCM (0.2 M) stirring at r.t. is 
added TFA (3 equiv.) and the solution is allowed to stir 
for 12 hr. The solution is evaporated to dryness in vacuo 
15 and the residue dissolved in DCM, washed once with brine 
and dried (Na2S04) . Filtration and evaporation of the 
solvent affords III. 

Step C 

20 1% DVB (divinylbenzene) cross-linked polystyrene beads 
(50^) functionalized with aminomethyl groups (1.1 mEq/g) 
are suspended in DMF in a peptide reaction vessel 
(Merrifield vessel) . Ill (2 equiv) and HOBt (3 equiv) in 
DMF are added and the vessel shaken for 10 min. DIG (3 eq) 

25 is added and the vessel is shaken \intil a negative 
Ninhydrin test indicates completion of the reaction after 
12 hr. 
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The DMF is removed and the resin washed with additional 
DMF (x5) and DCM (x5) before drying in vacuo . 

Step D 

5 The dry resin is divided into 6 reaction vessels and is 
suspended in DCM, The appropriate combinations of 
identifiers IX^.^ (see Table 4-1) are added to the flasks 
and shaken for 1 hr. The Rh(TFA)2 catalyst (1 mol%) is 
added to each flask and sihaken for an additional 2 hr. 

10 The flasks are drained and the resin washed with DCM (x5) . 
The resin is then treated with a solution of TFA in DCM 
(0.01 M) and shaken for 30 min. and then washed again with 
DCM (x3) followed by DMF (x2) , The resin is treated with 
a 20% solution of piperidine in DMF and shaken for 30 min. 

15 and is then washed with DMF (x3) and DCM (x3) . 

To each flask is added the appropriate Fmoc protected 
amino acylfluoride (3 equiv) (when required side-chain 
functional groups are protected as tert -butyl ester (Asp) , 

20 tert -butyl ether (Tyr) or tert -butvloxvcarbonyl (Lys) ) 
with 2 . 6-di- tert -butyl-4-methylpvridine (10 equiy) and the 
flasks shaken overnight or until a negative Ninhydrin test 
is achieved. The resin is washed once (DCM) and then the 
six batches are combined and washed again (DCM, x5) before 

25 drying in vacuo . 
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Step E 

The dry resin is divided into five reaction vessels and is 
suspended in DCM. The appropriate combinations of 
identifiers IX^.f (see Table 4-1) are^ added to the flasks 
5 and shaken for 1 hr. The Rh(TFA)2 catalyst (1 mol%) is 
added to each flask and shaken for an additional 2 hr. 
The flasks are drained and the resin washed with DCM (x5) . 
The resin in then treated with a solution of TFA in DCM 
(0.01 M) and shaken for 30 min. and is then washed with 
10 DMF (x3) and DCM (x3) . 

To each flask is added a solution of 5% acetic acid in DMF 
and the mixtures are heated to €0°C and shaken overnight. 
The solvent is drained and then the resin washed with DMF 
15 (x5) . 

Step F 

Each batch of resin is suspended in THF and the flasks are 
cooled to -78*»C. To each flask is added a solution of 

20 lithiated 5- (phenylmethyl) -2-oxazolidinone (2 equiv) in 
THF and the mixtures are shaken at -78**C for 1 hr. The 
appropriate alkylating agent (Table 4-2) (4 equiv) is then 
added to each reaction flask followed by a catalytic 
amount of DMF. The vessels are allowed to warm to ambient 

25 temperature and shaken at this temperature for 5 hrs. The 
solvent is removed by filtration and the resin washed with 
THF (xl) and then dried in vacuo . The batches of resin 
are then combined and washed with THF (x2) and DCM (x2) 
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and the combined resin is then treated with a 95:5:10 
mixture of TFA : water :dimethylsulphide for 2 hrs to remove, 
the side chain protecting groups. 



TABLE 


4-2 


IDENTIFIER 


ALKYLATING 
AGENT 


e 


H3CI 


d,e 


CjHsBr 


f 


BrCH2-CH=CH2 ' 


d,f 


BrCHjCgHs 



10 

step G 

The resultant benzodiazepine can be cleaved from a bead of 
polystyrene by suspending the bead in DMF and irradiating 
with U,V. (350 nm) for 12 hrs. 

15 

Step H 

A bead of interest is placed into a glass capillary tube. 
Into the t\ibe is syringed 1 of IM aqueous cerium (IV) 
ammonium nitrate (CAN) solution, 1 /zL of acetonitrile and 
20 2/iL of hexane. The txibe is flame sealed and then 
centrifuged to ensure that the bead is immersed in the 
reagents. The tube is placed in an ultrasonic bath and 
sonicated from 1 to 10 hrs preferably from 2 to 6 hrs. 
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The tube is cracked open and ^1 /iL of the upper hexane 
layer, is mixed with -0.2 fih of bi&(trimethylsilyl) - 
acetamide (BSA) prior to injection into the GC and each 
tag member determined using electron capture detection, as 
5 exemplified in the following scheme. 
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EXAMPLE 5 

117.649 Peptide Library 
An encoded library of 117,649 peptides was prepared. This 
library had the sequence HaN-XXXXXXEEDLGGGG-bead, where the 
5 variable residue X was D,E,I,K,L,Q or S. This library was 
encoded using the 18 tags as defined in Example 3; three 
binary bits being sufficient to represent the seven amino 
acids used in each step. The code was: 001=S; 010=1; 
011=K; 100=L; 101=Q; 110=E; and 111=D, where 1 indicates 
10 the presence and 0 indicates the absence of a tag. 

The constant segment of the library (HjNEEDLGGGG-bead) was 
synthesized on 1.5 g of 50-80 ^ Merrifield polystyrene 
synthesis beads functionalized with 1.1 mEq/g of 

15 aminomethyl groups using standard solid phase methods 
based on t-Bu sidechain protection and Fmoc mainchain 
protection. After deprotecting the N- terminal Fmoc 
protecting group with diethylamine, the beads were divided 
into seven 200 mg portions, each portion being placed into 

20 a different Merrifield synthesis vessel mounted on a 
single wrist-action shaker. 

The beads in the seven vessels were processed as in Table 
3-1 to attach the sets of identifiers (T1-T3) and the 
25 corresponding amino acid to each portion except that 
instead of DIG, i-butylchloroformate was used for 
activation. 
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This procedure first chemically attached small amounts of 
appropriate identifiers via their carboxylic acids to the 
synthesis beads. This attachment was achieved by 
activating the linker carboxyl groups as mixed carbonic 
5 anhydrides using isob utylchlorof ormate . and then adding an 
amount of activated identifier corresponding to 1% of the 
free amino groups attached to the beads. Thus^ about 1% 
of the free amino groups were terminated for each 
identifier added. The remaining free amino groups were 
10 then coupled in the usual way with the corresponding 
protected amino acids activated as their . symmetrical 
anhydrides. 

After washing, the seven portions were combined and the 
15 Fmoc protected amino groups were deprotected by treatment 
with diethylamine . The beads were again divided into 
seven portions and processed as before, except that the 
appropriate identifiers carrying tags T4, T5, and T6 were 
added to the reaction vessels . 
20 The procedure of dividing, labelling, coupling the amino 
acid combining and main- chain deprotection was carried out 
a total of six times using identifiers bearing tags Tl- 
T18, affording an encoded peptide library of 117,649 
different members. 

25 
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Tvpical Identifier Preparation 

To a solution of 8-bromo-l-octanol (0.91 g, 4.35 mmol) and 
2,4,6-trichlorophenol (1.03 g, 5.22 mmol) in DMF (5 mL) 
was added cesium carbonate. (1.70 g, 5.22 mmol) resulting 
5 in the evolution of gas and the precipitation of a white 
solid. The reaction was stirred at 80° C for 2 hrs. The 
mixture was diluted with toluene (50 mL) and poured into 
a separatory funnel, washed with 0.5 N NaOH (2x50 mL) , IN 
HCl (2x50 mL) and water (53 mL) and the organic phase was 
10 dried (MgS04) . Removal of the solvent by evaporation gave 
1.24 g (87% yield) of tag as a clear oil. 

The above tag (0.81 g, 2.5 mmol) was added to a 2 M 
solution of phosgene in toluene (15 mL) and stirred at 

15 room temperature for 1 hr. The excess phosgene and the 
toluene were removed by evaporation and the resulting 
crude chlorof ormate was dissolved in DCM (5 mL) and 
pyridine (0.61 mL, 7.5 mmol). tert-Butyl 4-hydroxy- 
methyl-3-nitrobenzoace (Barany and Albericio, J. Am. Chem. 

20 Soc, (1985), 107, 4936-4942) (0.5 g, 1.98 mmol) was added 
and the reaction mixture stirred at room temperature for 
3 hrs. The solution was diluted with ethyl acetate (75 
mL) and poured into a separatory funnel. After washing 
with IN HCl (3x35 mL) , saturated NaHCOa (2x35 mL) and brine 

25 (35 mL) , the organic phase was dried (MgSO^) . The solvent 
was removed by evaporation and the residue purified by 
chromatography on silica gel (5% to 7.5% ethyl acetate in 
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petroleum ether) affording 0.95 g (79% yield) of the 
identifier ter t-butyl ester as a clear oil. 

Trif luoroacetic acid (3 mL) was added to a solution of the 
5 identifier tert-butyl ester (0,95 g, 1.57 tnmol) in DCM (30 
mL) to deprotect the linker acid (i.e., pi-ps of Formula I) 
and the solution was stirred at room temperature for 7 
hrs . The mixture was then evaporated to dryness and the 
residue redissolved in DCM (30 mL) . The solution was 
10 washed with brine (20 mL) and the organic phase dried 
(MgSO^) . Removal of the solvent by evaporation gave 0.75 
g (87% yield) of the identifier (6B) as a pale yellow 
solid. (Tag nomenclature is the same as in Example 3) . 

15 Typical Encoded Library Synthesis Step 

Na-Fmoc-E(tBu) -E (tBu) -D(tBu) -L-G4-NH-resin was suspended 
in DMF (20 mL) and shaken for 2 min. After filtering, 1:1 
diethylamine : DMF (40 mL) was added to remove the Fmoc 
protecting groups and the resin was shaken for 1 hr. The 

20 resin was separated by filtration and washed with DMF 
(2x20 mL, 2 min each); 2:1 dioxane: water (2x20 mL, 5 min 
each) , DMF (3x20 mL, 2 min each) , DCM (3 x 20 mL, 2 min 
each) then dried in vacuo at 25** C. (The resin was found 
to have 0.4 mmol/g amino groups by picric acid titration 

25 at this stage.) 

150 mg Portions of the resin were placed into seven 
Merrifield vessels and suspended in DCM (5 mL) . The 
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appropriate identifiers were activated as their acyl 
carbonates as follows (for the first coupling) : Tl {6.6 
mg, 0.0098 mmol) was dissolved in anhydrous ether (2 ttiL) 
and pyridine (10 fiW was added. Isobutyl chlorof ormate 
5 (1.3 ^L, 0.0096 mtnol) was added as a solution in anhydrous 
ether (0.1 mL) . The resulting mixture was stirred at 25** 
C for 1 hr. during which time a fine white precipitate 
formed. The stirring was stopped and the precipitate was 
allowed to settle for 30 min. Solutions of the 

10 acylcarbonates of T2 and T3 were prepared in the same way. 
Aliquots (0.25 mL) of the supernatant solution of 
activated identifiers were mixed to give the appropriate 
3 -bit binary tag codes and the appropriate coding mixtures 
of identifiers were added to each of the seven synthesis 

15 vessels. The vessels were shaken in the dark for 12 hrs, 
and then each was washed with DCM (4x10 mL, 2 min each) . 
A solution of the symmetrical anhydride of an Na-Fmoc 
amino acid in DCM (3 equivalents in 10 mL) was then added 
to the corresponding coded batch of resin and shaken for 

20 20 min. 5% N,N-diisopropylethylamine in DCM (1 mL) was 
added and the mixture shaken iintil the resin gave a 
negative Kaiser test. 

The resin batches were filtered and combined, and then 
25 washed with DCM (4x20 mL, 2 min each) , isopropanol (2x20 
mL, 2 min each) , DCM (4x20 mL, 2 min each) . The next 
cycle of labelling/coupling was initiated by Fmoc 
deprotection as described above. 
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After Fmoc deprotection of the residues in the last 
position of the peptide, the side chain functionality, was 
deprotected by suspending the resin in DCM (10 mL) , adding 
thioanisole (2 mL) , ethanedithiol (0.5 mL) and tri- 
5 fluoroacetic acid (10 mL) then shaking for 1 hr at 25** C. 
The resin was then washed with DCavi (6x20 mL, 2 min each) 
and dried. 

Electron Capture Gas Chromatography Reading of Code 

10 A single, selected bead was placed in a Pyrex capillary 
tube and washed with DMF (5x10 fiL) . The bead was then 
suspended in DMF (1 /iL) and the capillary was sealed. The 
suspended bead was irradiated at 366 nm for 3 hrs to 
release, the tag alcohols, and the capillary tube 

15 subsequently placed in a sand bath at 90^ C for 2 hrs. 
The tube was opened and bis-trimethylsilyl acetamide (0.1 
mL) was added to trimethylsilylate the tag alcohols. 
After centrifuging for 2 min., the tag solution above the 
bead (1 /iL) was injected directly into an electron capture 

20 detection, capillary gas chromatograph for analysis. Gas 
chromatography was performed using a Hewlett Packard 
Series II Model 5890 gas chromatograph equipped with a 0.2 
mmx20 m methylsilicone fused silica capillary column and 
an electron capture detector. Photolysis reactions were 

25 performed using a UVP "Black Ray" model UVL 56 hand-held 
366 nm^ lamp. 
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Aatibodv Affinity Methods 

The anti-C-myc peptide monoclonal antibody 9E10 was 
prepared from ascites fluid as described in Evans et al . . 
Mol, Cell Biol., 5, 3610-3616 (1985) and Munro and Pelham, 
5 Cell, 48/ 899-907 (1987) . To test beads for binding to 
9E10, beads were incubated in TEST [20 mM Tris-HCl (pH 
7.5) , 500 mM NaCl and 0.05% Tween-20] containing 1% bovine 
serum albumin (BSA) to block non-specific protein binding 
sites. The beads were then centrifuged, resuspended in a 

10 1:200 dilution of 9E10 ascites fluid in TEST + 1% BSA and 
incubated overnight at 4 **C . Beads were subsequently 
washed three times in TEST and incubated for 90 min. at 
room temperature in alkaline phosphatase -coupled goat 
antimouse IgG antibodies (Bio-Rad Laboratories) , diluted 

15 1:3000 in TEST + 1% BSA. After washing the beads twice in 
TEST and once in phosphatase buffer (100 mM Tris-HCl, pH 
9.5, 100 mM NaCl and 5 mM MgCla) / the beads were incubated 
1 hr at room temperature in phosphatase buffer containing 
one one-hundreth part each of AP Color Reagents A & B 

20 (Bio-Rad Laboratories) . To stop the reaction, the beads 
were washed twice in 20 mM sodium EDTA, pH 7.4. Solution 
phase affinities between 9E10 and various peptides were 
determined by a modification of the competitive ELISA 
assay described by Harlow et al . , Antibodies: a Laboratory 

25 Manual, 570-573, Cold Spring Harbor Press, Cold Spring 
Harbor,.. N.Y. 
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From a 30 mg sample of the combinatorial library of 
peptides, 40 individual beads were identified which 
stained on exposure to the anti-C-myc monoclonal antibody. 
Decoding of these positive-reacting beads established the 
5 ligand's reaction sequence as the myc epitope (EQKLISEEDL) 
or sequences that differed by one or two substituents 
among the three N-terminal residues. 

EXAMPLE 6 

10 23.540.625 Mixed Amide Library 

The encoding technique was tested further by the 
preparation of a combinatorial library of 23,540,625 
members consisting of peptides and other amide compounds. 

15 The synthesis was carried out using 15 different reagents 
in 5 steps and 31 different reagents in the sixth step. 
Four identifiers were used to encode each of the 5 steps 
with 15 reagents and five identifiers were used in the 
final step with 31 reagents. A label set of 25 

20 identifiers was therefore prepared. 2-Nitro-4- 
carboxybenzyl, O-aryl substituted cj-hydroxyalkyl carbonate 
identifiers were employed, where the tag components were 
comprised of an alkyl moiety of from 3 to 12 carbon atoms 
and the aryl moieties were (A) pentachlorophenyl , (B) 

25 2,4,5-trichlorophenyl, (C) 2,4, 6-trichlorophenyl, or (D) 
2, 6-dichloro-4-fluorophenyl. . A set of 25 tags was 
prepared using appropriate alkyl chains lengths with A, B, 
C or D, separable using a 0.2 mMx25M methylsilicone GC 



wo 95/28640 



PCr/DS95/04683 



-102- 

column. The chemical compositions of tags T1-T25 (where 
Tl represents the tag with the longest retention time, and 
T25 the tag with the shortest retention time) are 
summarized below: 



Tl 


lOA 


T6 


IOC 


Til 


7B 


T16 


5C 


T21 


2B 


T2 


9A 


T7 


9B 


T12 


7C 


T17 


4B 


T22 


2C 


T3 


8A 


T8 


9C 


T13 


6B 


TIB 


4C 


T23 


IB 


T4 


7A 


T9 


8B 


T14 


6C 


T19 


3B 


T24 


IC 


T5 


lOB 


TIO 


BC 


T15 


5B 


T20 


3C 


T25 


2D 



10 

The designations lOA, 9A, etc. are as described in Example 
3. 

The fifteen reagents used in the first five stages and the 
15 code identifying them are represented below where 1 
represents the presence of tag and O the absence thereof . 
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REAGENT 


CODE 


L-serine 


(0001) 


D- serine 


(0010) 


L-glutamic acid 


(0011) 


D-glutamj c acid 


(0100) 


L-glutamine 


(0101) 


D-glutamine 


(0110) 


L- lysine 


(0111) 


D- lysine 


(1000) 


L-Proline 


(1001) 


D- Proline 


(1010) 


L-phenylalanine 


(1011) 


D-phenylal?jiine 


(1100) 


3 - amino - r^enzoic 
acid 


(1101) 


4 -aminophenyl 
acetic acid 


(1110) 


3,5-diamino- 
benzoic acid 


(1111) 
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The 31 reagents and the code representing them in the 
sixth stage are represented below: 



10 



15 



20 



REAGENT 


CODE 


L- serine 


(00001) 


D- serine 


(00010) 


L-glutamic acid 


(00011) 


D-glutamic acid 


(00100) 


L-glutamine 


(00101) 


D-glutamine 


(00110) 


L- lysine 


(00111) 


D- lysine 


(01000) 


L-proline 


(01001) 


D-proline 


(01010) 


L-phenylalanine 


(01011) 


D - pheny la 1 anine 


(01100) 


3 -amino-benzoic acid 


(01101) 


4-aminophenyl acetic acid 


(OHIO) 


3 , 5-diamino-benzoic acid 


(01111) 


Succinic Anhydride 


(10000) 


Tiglic acid 


(10001) 


2-pyrazine carboxylic acid 


(10010) 
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(±)thioctic acid 


(10011) 


1-piperidinepropionic acid 


(10100) 


piperonylic acid 


(10101) 


6-Tnethylnicotinic acid 


(10110) 


3- (2-thienyl) acrylic acid 


(10111) 


methyl iodide 


(11000) 


tosyl chloride 


(11001) 


p-toluenesulfonyl isocyanate 


(liolo) 


3-cyanobenzoic acid 


(11011) 


phthallic anhydride 


(11100) 


acetic anhydride 


(11101) 


ethyl chloroformate 


(11110) 


mesylchloride 


(11111) 



15 A spacer of six glycine units was prepared on the beads 
using standard methods. The variable region was 
constructed using butyl sidechain protection, and amino 
groups were protected as Fmoc derivatives. Amide bonds 
were formed by activation of the carboxylic acid with DIG 

20 and HOBt. 
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EXAMPLE 7 

Hetero-Diels-Alder Library 
A combinatorial hetero Diels -Alder library comprising 42 
compounds of the formula: 



10 



15 




wherein; 

is H, CH3O, F3C, F3CO, HsCfiO, or CgHii; 
is H, CH3, or CH3O; 
R^ is H (when n=2) , or CH3 (when n=l) ; and 



Ar = 



R = H or CI or Ar = 



00 
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STEP C 
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0 

IV 



1) Toluene 


2 


A 






R T 




2 

R 


2) Identifiers X 






STEP D 




BF3. EtgO 
DCM 

STEP E 



wo 95/28640 



-109- 



PCT/US95/04683 




wo 95/28640 PCr/US95/04683 

-110- 

The azatricyclic products {VI ) were constructed on 
polystyrene beads and were linked to the beads by a 
photocleavable linker allowing the azatricycle {VII ) to be 
removed from the bead by exposure to U.V. light {350 nm in 
5 DMF) . The binary codes introduced in steps C,D and E 
allow a unique determination of the reaction sequence used 
to introduce ArR, R\ and R^ The encoding tags were 
removed according to seep G and analyzed by electron 
capture detection follov:irg GC separation . 



10 



15 



The identifiers used in this scheme are represented by the 
formula X: 

CI 

CI 



"2 T CI 



0 



Wherein; 

20 indicates n=10 

Xu indicates n=9 
Xc indicates n=8 
Xd indicates n=7 
Xe indicates n=6 

25 Xf indicates n=5 
Xg indicates n=4 



X 



The codes for each of R, RS R^, R^ are as follows: 
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TABLE 7-1 



Ar = 




R = H 



10 b 



Ar = 




R = CI 



15 a,b 



Ar = 




R^=H R2=H 



20 d 



R^=H R*=CH3 



R^=0CH3 R^sOCHj 



25 



R^=CF, R'=H 



e,c 



R^sCgHjO R2=H 



e,d 



R^=F,CO R2=H 
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f R3=CH3 n=l 

5 g R^=H n=2 

Step A 

To a solution of I (2.03 g, 8 mmol) , 4-hydroxybenzaldehyde 
(1.17 g, 9.6 mmol) and triphenylphosphine (2.73 g, 10.4 
imnol) in toluene (20 mL) stirring at O^C was added over a 

10 period of 30 minutes diethylazodicarboxylate . The 
solution was allowed to warm and stirred for 1 hour once 
ambient temperature had been reached. The solution was 
concentrated by removal of approximately half of the 
solvent jji vacuo and was then triturated with ether • The 

15 mixture was then filtered and the residue was washed 
thoroughly with ether. The solvent was removed in vacuo 
and the residue was purified by chromatography on silica 
gel (15% ethyl acetate in hexane) affording 1,3 g of the 
ether Ila (47% yield) . 

20 

2 -chloro-4 -hydroxybenzaldehyde and 2 -hydroxy- 1 - 
naphthaldehyde were coupled to I in analogous fashion 
affording ethers lib and c in yields of 91% and 67%, 
respectively. 

25 

Step B 

To a solution of ether Ila (0*407 g, 1.14 mmol) in DCM (20 
mL) stirring at room temperature was added TFA (8 mL) . 
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The solution was allowed to stir for 6 hrs. The solution 
was evaporated to dryness in vacuo affording 0.343 g of 
acid Ilia (100% yield) . Ethers lib and lie were 
deprotected analogously affording acids I lib and c in 
5 yields of 92% and 100% respectively. 

Step C 

Into a peptide reaction vessel (Merrifield vessel) were 
measured 1% DVB (divinylbenzene) cross-linked polystyrene 

10 beads (50-80/i) f una tionali zed with aminotne thy 1 groups (1.1 
nieq/g) (200 mg of resin) . The resin was suspended in DMF 
(2 mL) and shaken for 20 min. The acid Ilia (38 mg, 2 
equiv.), l-hydroxybenzotriazole (40 mg, 2 equiv) and 
diisopropylcarbodiimide (38 mg, 2 equiv) were added and 

15 the mixture shaken until a negative Ninhydrin test was 
achieved (22 hr) . The solution was removed by filtration 
and the resin was washed with DCM (8x 10 mL) . 

The resin was resuspended in DCM (5 mL) , identifier Xa (15 
20 mg) was added and the flask was shaken for 1 hr. Rh(TPA)2 
catalyst (1 mol%) was added and the flasks shaken for 2 
hrs. The solvent was removed by filtration and the resin 
resuspended in DCM (5 mL) . Trif luoroacetic acid (1 drop) 
was added and the vessel shaken for 20 min. The solvent 
25 was removed by filtration, and the resin was washed with 
DCM (8x 10 mL) . 
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In an analogous fashion, acids Illb and IIIc were attached 
to the resin and were encoded with the appropriate 
identifiers, i.e., Xb for acid Illb and Xa and Xb for acid 
IIIc. The three batches of resin were combined, mixed, 
5 washed, and dried. 

Step D 

The dry resin was divided into 7 equal portions {87 mg) 
which were put into seven peptide reaction vessels 
10 (Merrifield vessels) which were wrapped with heat tape. 
The resin in each vessel was suspended in toluene (10 mL) 
and shaken for 20 min. An appropriate amount of one 
aniline was then added to each flask (see Table 7-2) . 

15 TABLE 7-2 



FLASK 


ANILINE 


AMOUNT ADDED 


1 


Aniline 


3 mL 


2 


3 , 5 -dimethyl aniline 


3 mL 


3 


3,4, 5-trimethoxyaniline 


2 g 


4 


4 - trif luoromethylaniline 


3 mL 


5 


4 -phenoxyaniline 


2 g 


6 


4 - 1 r i f luorome t hoxyani 1 ine 


3 mL 


7 


4 - eye lohexy 1 ani 1 ine 


2 g 



25 The heating tape was connected and the reaction 

mixtures shaken at 70 ^'C for 18 hrs. The heat tape was 
disconnected and the solvent was removed by filtration and 
* each batch of resin was washed with dry DCM {4x 10 mL) , 
ethjsr (10 mL) , toluene (10 mL) and DCM (2x 10 mL) . Each 

30 of the portions was then suspended in DCM (5 mL) and to 
each flask was added the appropriate identifier or 
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combination of identifiers (Xc-e) (15 mg) (see Table 7-1) . 
The flasks were shaken for 1 hr. and then Rh(TFA)2 (1 mol%) 
was added to each flask and shaking continued for 2 hrs. 

5 The solvent was then removed and each batch of resin was 
re-suspended in DCM (5 mL) to which was added TFA (1 
drop). This mixture was shaken for 20 rain., then the 
solvent was removed by filtration. The batches of resin 
were then washed (DCM, Ix 10 mL) and combined, washed 
10 again with DCM (3x 10 mL) and then dried thoroughly in 
vacuo. 

Step E 

The dried resin was divided into two equal portions (0.3 
15 g) and each was placed in a peptide reaction vessel. The 
resin batches were washed with DCM (2x 10 mL) and then 
resuspended in DCM (5 mL) . To one flask was added the 
identifier Xf (15 mg) and to the other was added Xg (15 
mg) . The flasks were shaken for 1 hr. prior to the 
20 addition of Rh{TFA)2 catalyst (1 mol%) . The flasks were 
shaken for 2 hrs. and then the solvent was removed by 
filtration. Each batch of resin was washed with DCM (3x 
10 mL) , and each was then resuspended in DCM (5 mL) . 

25 The appropriate enol ether (1 mL) (see Table 7-1) was added 
to the flasks and the vessels shaken for 30 min. To each 
flask was added a solution of BFa^OEta (0.5 mL of a 5% 
solution in DCM) and the flasks were shaken for 24 hrs. 
Removal of the solvent by filtration was followed by 

30 washing of the resin with DCM (10 mL) and the resin was 
then combined. The beads were then washed further with 
DCM (5x 10 mL) , DMF (2x 10 mL) methanol (2x 10 mL) and DCM 
(2x 10 mL) . The resin was then dried thoroughly in vacuo . 

35 gt:ep F 

To confirm the identity of the products produced in the 
Hetero-Diels-Alder library one example was completed on a 
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large scale to allow confirmation of the structure by 
spectroscopic means. The procedure followed was 

essentially the same method as described for the 
combinatorial library. In step A 4-hydroxyben2aldehyde 
5 was coupled to the photolabile group. In step D, aniline 
was condensed with the aldehyde. In step E, the enol 
ether was formed with 4 , 5-dihydro-2-methylfuran. 

The photolysis of the compound (step F) was performed by 
suspending 100 mg of the beads in DMF (0.3 mL) and 
irradiating the beads with UVP "Black Ray" model UVL 56 
hand-held 366 nm lamp for 16 hrs . The DMF was removed to 
one side by pipette and the beads rinsed with additional 
DMF (2x 3 mL) . The original solution and the washings 
were combined and the solvent removed in vacuo . NMR 
analysis of the reaction mixture showed it to contain the 
desired azatricycle by comparison to the authentic sample. 

Step G 

20 A bead of interest was placed into a pyrex glass capillary 
tube sealed at one end. A solution (1 ^L) of IM aqueous 
cerium (IV) ammonium nitrate and acetonitrile (1:1) was 
syringed into the tube, and the tube was then centrifuged 
so that the bead lay on the bottom of the capillary and 

25 was completely immersed by the reagent solution. Hexane 
(2 liL) was added by syringe and the tube was again 
centrifuged. The open end of the capillary was flame- 
sealed and placed in an ultrasonic bath for 4 hrs. The 
capillary was then placed inverted into a centrifuge and 

30 spun such that the aqueous layer was forced through the 
hexane layer to the bottom of the tube. This extraction 
process was repeated 3 or 4 times and the tube was then 
opened. The hexane layer (1.5 (xh) was removed by syringe 
cuid placed intp a different capillary containing BSA (0.2 

35 ^L) , This tube was sealed and centrifuged until the 
reagents were thoroughly mixed. A portion of the solution 
(ca. 1 (iL) was removed and injected into a gas 



10 



15 
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chromatography machine with a 25M x 0.2 mM methylsilicone 
fused silica column with electron capture detection for 
separation and interpretation of the tag molecules. 



5 The sample was injected onto the GC column at 200<*C and 25 
psi of carrier gas (Hea) . After 1 minute the temperature 
was increased at a rate of 20°C per minute to 320°C, and 
the pressure was increased at a rate of 2 psi per minute 
to 40 psi. These conditions are shown in the following 
10 diagram: 

GC CONDITIONS 



TEMPERATURE 



20 




1 min 



25 



PRESSURE 



40 psi 



25 psi 



2 psi per min 



30 



1 min 



I 
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The following results were obtained with four randomly 
selected beads: 

Bead 1 





TAG DETECTED 




Xf 


Xe Xd Xc 


Xb Xa 


Ar 






2 -Hydroxy naphthyl 














H 




R^ 


CH3 (n=l) 







10 

Bead 2 





TAG DETECTED 




Xg 


Xe Xd Xc 


Xb 


Ar 






2 - chl oro - 4 - hydr oxyphenyl 


R* 




CjHii 




R2 




H 




R' 


H (n=2) 







Bead 3 





TAG DETECTED 




Xg 


Xe Xd 


Xb Xa 


Ar 






2 -Hydroxy naphthyl 


R^ 




F3CO 




R2 




H 




R^ 


H (n=2) 







Bead 4 





TAG DETECTED 




Xf 


Xe Xd 


Xb 


Ar 






2 - chloro - 4 - hydroxypheny 1 


R^ 




F3CO 




R2 




H 




R' 


CHj (n=l) 
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EXAMPLB 8 

Benzodiazepine Library 

Following the procedure of Example 4, a combinatorial 
5 library is constructed of the Formula X 



2 




wherein 

20 R is a radical of a naturally occurring D or L amino acid; 

is H, Ci-Cfi alkyl, lower alkenyl, Ci-Cg alkylamine, 

carboxy Ci-Cg alkyl, or phenyl Ci-Cg alkyl wherein the 

phenyl is optionally substituted by lower alkyl, F, Cl, 

Br, OH, NHj, CO2H, or O- lower alkyl; 
25 r2 is H or CO2H; 

R^ is H or OH; 

R* is H or CI; 

with the provisos that when R^ is OH, R^ is H and when R^ 
is carboxy, R^ is H. 

30 
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This library is released from a plurality of encoded beads 
of the general formula 



10 




15 wherein 

IXn is a plurality of identifiers of the Formula la wherein 

said plurality represents an encoded scheme; 
S is a substrate; 

pi'-pz is the residue of the linker member of Formula la; 
20 and 

R, R^, R^, and R* are as defined for Formula X. 



\ 
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EXAMPLE 9 

Typical Identifie r Preparations 
The diazo compoiind identifiers which are attached to the 
resin via carbene formation are prepared as exemplified. 

5 

Compounds of the general formula 



10 




15 

wherein 

n is 0-10 and 

Ar is pentachlorophenol , 2,4, 6 - trichlorophenol , 
20 2,4, 5 -trichlorophenol , or 2 , 6-dichloro-4 -f luorophenol 

are prepared as follows. 

To a solution of l-hydroxy-4- (2 , 6-dichloro-4-f luoro- 
phenoxy) butane (0.38 g, ,1.5 mmol) , isovanillin (0.228 g, 

25 1.5 mmol) and triphenylphosphine (0.393 g, 1.5 mmol) in 
THF (8 mL) was added diethylazodicarboxylate (0.287 g, 1.7 
mmol) . The solution stirred at r.t. for 36 hrs. The 
solvent was removed in vacuo and the residue purified by 
chromatography on silica gel (with a mixture of 20% ethyl 

30 acetate and 80% petroleum ether) affording 0.45 g of the 
aldehyde (77% yield) . . 

The aldehyde (100 mg, 0.26 mmol) was dissolved in acetone 
. (8 mL) and was treated with a solution of KMn04 (61 mg, 
35 0.39 mmol) in acetone (4 mL) and water (4 mL) . The 
reaction stirred at room temperature for 13 hrs. The 
mixture was diluted with ethyl acetate (100 mL) and water 
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(50 mL) and the layers were separated. The aqueous layer 
was extracted with additional ethyl acetate {2x 100 mL) . 
The combined organic layers were washed with water (50 mL) 
and dried (MgSOJ . Removal of the solvent afforded 109 mg 
5 of the benzoic acid (93% yield) . 

A solution of the acid {76^ mg, 0.188 mmol) in methylene 
chloride (2 mL) was treated with oxalylchloride (36 mg, 
0.28 mmol) and catalytic DMF. After stirring for 10 min 

10 at room temperature slow but steady evolution of gas was 
observed. Stirring continued for 2 hrs. when the solution 
was diluted with DCM (15 mL) and washed . with saturated 
aqueous sodium hydrogencarbonate solution (5 mL) . The 
layers were separated. The organic layer was dried 

15 (NaaSO^) and the solvent evaporated affording the benzoyl 
chloride as pale yellow crystals. 

The benzoyl chloride was dissolved in methylene chloride 
(5 mL) and was added to a stirring solution of an excess 

20 of diazome thane in ether at -78®C, The cold bath was 
allowed to warm up and the mixture allowed to stir for 5 
hrs at room temperature. The solvents and excess 
diazomethcuie were removed in vacuo and the residue 
purified by chromatography on silica gel using gradient 

25 elution method where the concentration of ethyl acetate 
ranged from 10% to 40 '% in hexanes affording 48 mg of the 
diazo compound (60% yield) . 
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Compounds of the general formula: 



5 




10 



wherein; 
15 n is 0-10 and 

Ar is pentachlorophenol, 2,4, 6-trichlorophenol, 2,4,5- 

trichlorophenol, or 2 , 6-dichloro-4-f luorophenol 
are prepared as follows . 

20 Methyl vanillate (0.729 g, 4.0 mmole) , l-hydroxy-9- 
(2,3,4,5, 6 -pentachlorophenoxy)nonane (1.634 g, 4.0 mmole) 
and triphenylphosphine (1.259 g, 4.8 mmole) were dissolved 
in 20 mL dry toluene under argon. DEAD (0.76 mL, 0.836 g, 
4.8 mmole) was added dropwise, and the mixture was stirred 

25 at 25°C for one hour. The solution was concentrated to 
half volume and purified by flash chromatography eluting 
with DCM to give 1.0 g (1.7 mmole, 43%) of the product as 
a white crystalline solid. 

30 The methyl ester above (1.0 g, 1.7 mmole) was dissolved in 
50 mL THF, 2 mL water was added followed by lithium 
hydroxide (1.2 g, 50 mmole). The mixture was stirred at 
25 ®C for one hour then refluxed for five hours. After 
cooling to 25<*C the mixture was poured onto ethyl acetate 

35 (200 mL) * and the solution was washed with 1 M HCl (50 mL 
x3) then sat. aq. NaCl (Ix 50 mL) and dried over sodium 
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sulfate. The solvent was removed and the crude acid 
azeotroped once with toluene. 

The crude material above was dissolved in 100 mL toluene, 
5 10 mL (1.63 g, 14 mmole) thionyl chloride was added, and 
the mixture was refluxed for 90 min. The volume of the 
solution was reduced to approximately 30 mL by 
distillation, then the remaining toluene removed by 
evaporation in vacuo . The crude acid chloride was 

10 dissolved in 20 mL dry DCM and cooled to -78<*C under argon 
and a solution of approximately 10 mmole diazomethane in 
50 mL anhydrous ether was added. The mixture was warmed 
to room temperature and stirred for 90 min. Argon was 
bxibbled through the solution for 10 min. then the solvents 

15 were removed by evaporation in vacuo and the crude 
material was purified by flash chromatography eluting with 
10-20% ethyl acetate in hexane. The diazoketone (0.85 g, 
1.4 mmole, 82% over three steps) was obtained as a pale 
yellow solid. 

20 

The following identifiers have been prepared as described 
above : 

Photolablle Cleavage 

25 50 Identifiers were prepared of the formula: 



30 



35 




wherein: 
Ar is 
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and n is 1,2,3,4,5,6,7,8,9, and 10. 

Oxidative Cleavage Type I 

5 7 Identifiers were prepared of the formula 



10 




wherein: 
Ar is 

15 

20 and n is 4,5,6,7,8,9, and 10. 
Oxidative Cleavage Type II 

13 Identifiers were prepared of the formula 

25 

30 wherein: 
Ar is 

CI 




CI 
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and n is 1,2,3,4,5,6,7,8,9,10; 
and wherein: 
3Ar is 




CI 

10 and n is 0,3, and 9. 



15 
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Exasple 10 

Encoding Combinatorial Libraries with Tags Readable bv 
Mass Spectroscopy 

The tags 4, 11 and 13 (Scheme 8) of the same structure, 
5 but different molecular weights due to varying deuterium 
substitution, were each synthesized (Schemes 9 and 10) and 
separately analyzed by mass spectroscopy (MS) . Among MS 
techniques, positive chemical ionization mass spectroscopy 
(PCIMS) gave minimal fragmentation of the tag, such that 

10 only the molecular ion ([M+NH4]*) and one other fragment 
([MH-HjO]*) were evident (Figures 1, 2 and 3), This 
actually allowed the presence or absence of a tag to be 
determined by the observation of two signals, which 
removes any possible ambiguity when analyzing a more 

15 complex mixture . Approximately equal amounts of the three 
tags were then mixed and analyzed by PCIMS (figure 5) . 
Again, the two signals corresponding to each separate tag 
could easily be distinguished. 

20 Tag 4 was now transformed into the diazoketone precursor 
8 (Scheme 9) , then attached to Tentagel resin as 9 (Scheme 
12) . One bead of the 9 was subsecjuently removed and 4 
oxidatively released using eerie ammonium nitrate. PCIMS 
analysis again clearly showed the presence of tag 4. 

25 

In summary, the set of tags 4, 11 and 13 of the same 
structure, but different molecular weights were 
synthesized. All were easily detected simultaneously in 
a mixture by PCIMS. The small amoxint of 4 released from 
30 a single bead of Tentagel resin used in combinatorial 
library synthesis is detectable by PCIMS. MS is a viable 
and sensitive detection method for tags, and can be used 
as the basis for an encoding scheme of a combinatorial 
library. 

35 

Analysis of 4, 11 and 13 by PCIMS was obtained using a 
reagent gas mixture of 1% NH3 in CH4. 
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(2) . To a solution of 11.1 mL (125 mmole, 5.00 eq.) of 
1,4-butanediol (1), 6.97 mL (50.0 mmole, 2,00 eq.) of EtjN 
and 0.153 g (1.25 mraole, 0.05 eq.) of 4- 
dimethylaminopyridine in dry CHaClsdOO mL) at 0°C under 

5 Ar, was added 3.88g (25.0 mmole, 1.00 eq.) of 97% tert- 
butyldimethylsilyl chloride. The resulting solution was 
stirred at 0°C for 15 min, then 25°C for 16hours. The 
reaction was then diluted with CH2CI2 (250 mL) and washed 
with 1 M HCl (lOOmL) , saturated aq. NaHCOj (100 mL) and HjO 
10 (100 mL) , then dried (MgS04) . Removal of the volatiles in 
vacuo gave the crude product 2 as an oil. 

(3) • To a solution of -10.0 mmole of crude alcohol 2, 1.93 
g (10.5 mmole, 1.05 eq.) of pentaf luorophenol and 2.89 g 

15 (11.0 mmole, 1.10 eq.) of triphenylphosphine in dry CH2CI2 
(40 mL) at O^C under Ar, was added 1.73 mL (11.0 mmole, 
1.10 eq.) of diethyl azodicarboxylate . The resulting 
orange solution was stirred at 0«> C for 5 min, then 25«>C 
for 15 hours. The reaction was then diluted with CHjClj 

20 (250 mL) and washed with saturated aq. NaaCOj (100 mL) , 
saturated aq. NH4CI (lOOmL) and HjO (100 mL) , then dried 
(MgSO^) . Removal of the volatiles in vacuo and 
purification by flash chromatography (0-20% EtOAc in 
hexanes) gave the product 3 as an oil. 

25 

(4) . To a solution of 1.85 g (5.00 mmole, 1.00 eq.) of 
silyl -protected alcohol 3 in THF (20 mL) at 25 ®C, was 
added 10.0 ML (10.0 mmole, 2.00 eq.) of a 1.0 M solution 
of tetrabutylammonium fluoride in THF. The resulting 

30 orange solution was stirred at 25*>C for 4 hours. Removal 
of the volatiles in vacuo and purification by flash 
chromatography (20-40% EtOAc in hexanes) gave 1.10 g (86%) 
of the product 4 as an oil. 

35 (5). To a solution of 0.800 g (3.125 mmole, 1.00 eq.) of 
alcohol 4, 0.569 g (3.125 mmole, 1.00 eq.) of methyl 
vanillate and 0.984 g (3.75 mmole, 1.20 eq.) of 
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triphenylphosphine in dry CH2CI2 (20 mL) at O^C under Ar, 
was added 0.591 mL (3.75 mmole, 1.20 eq.) of diethyl 
azodicarboxylate . The resulting pale yellow solution was 
stirred at 0°C for 5 min, then 25°C for 19 hours. The 
5 reaction was then diluted with CH2CI2 (100 mL) and washed 
with 1 M NaOH (50 mL) , saturated aq. NH4CI (50 mL) and H2O 
(50 mL) , then dried (MgS04) . Removal of the volatiles in 
vacuo and purification by flash chromatography (20% EtOAc 
in hexanes) gave the product 5 as an oil. 

10 

(6) . To a solution of 3.125 mmole of ester 5 in THF (12 
mL) was added 1.31 g (31.3 mmole, 10.0 eq.) of lithium 
hydroxide monohydrate. MeOH (24 mL) was added to the 
resulting suspension to form a solution, which was stirred 

15 at 25^C for 1 hours, then refluxed for 1 day. Volatiles 
were removed in vacuo, and 1 M HCl then added until 
solution was -pH 1. The white precipitate of product 
which formed was collected and dried to give 0.968 g (76% 
- 2 steps) of 6. 

20 

(7) . To 0.968 g (2.38 mmole, 1.00 eq.) of carboxylic acid 
6 under Ar, was added 2.43 mL of thionyl chloride. The. 
resulting suspension was refluxed for 1.5 hours, after 
which time a yellow solution had foarmed. Volatiles were 

25 removed in vacuo, and the resulting residue azeotroped 
three times with toluene to give the product .7 as 
colorless crystals. 

(8) • To a solution of 2.38 mmole of acid chloride 7 in 1:1 
30 THFrMeCN (20 mL) at 0°C under Ar, was added 1.16 mL (8.33 

mmole, 3.50 eq.) of EtaN followed by 3.57 mL (7.14 mmole, 
3.00 eq.) of a 2.0 M solution of 
(trimethylsilyl)diazomethane in hexanes. The resulting 
yellow solution was stirred at O^C for Ih, then 25<>C for 
35 1 day. The reaction was diluted with EtOAc (150 mL) and 
washed with saturated aq. NaHCOa (2 x 75 mL) and saturated 
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aq. NaCl (2 x 75 mL) , then dried (MgSOj - Removal of the 
volatiles in vacuo gave the crude product 8 as an oil. 

(11) . Using commercially available l,4-butanediol-2,2/ 3/ 3- 
5 (10) in place of 1, and an analogous procedure to that 

described for the transformation 1 into 4, 11 was obtained 
in 41% yield over three steps. 

Using commercially available l,4-butanediol-2,2,3,3,4,4-dB 
10 (10) in place of 1, and an analogous procedure to that 
described for the transformation 1 into 4, 13 was obtained 
in 42% yield over three steps. 

Tag 4 was introduced onto the solid support using 5 to 50% 
15 (w/w vs. resin) precursor diazoketone 8 to give 9 by 
essentially the same procedure given in Example 4; the 
Hetero Diels-Alder library. Tag 4 was also subsequently 
removed from 9 by essentially the same procedure as Step 
G in Example 4 . 

20 

The diazoketones corresponding to tags 11 and 13 are used 
to introduce . these tags onto the solid support, so that 
along with 8, they yield members of a binary encoding set. 
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Schem 8 




F 

4 

M.V. = 256 




I 



wo 95/28640 



-132- 



PCTAJS95/04683 




wo 95/28640 



PCT/US95/04683 



-133- 
Scheme 10 
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Preparation of Deuteriated Taos 





CD3CD20H 



PPh3, DEAD 



OH 



10 



15 



20 



25 



To a solution of PPhj (3.2 g, 12,1 mmol) in THF (3 0 mL) at 
0 »C was added diethyl azodicarboxylate (1.9 ml, 12 mmol) . 
After 10 minutes at 0 ^C, a solution of methyl 4- 
hydroxyphenylacetate (1.66 g, 10 mmol) and dg-ethanol (800 
fiL, 12.3 mmol) in 10 mL THF was added. After stirring at 
25 ®C for 2 hours, the reaction mixture was concentrated 
and purified by flash chromatography to give methyl 4-ds 
ethoxyphenylacetate (1.86 g, 93%) as a colorless oil. 



To a solution of 4-d5 ethoxyphenylacetate (1 g, 5 mmol) in 
dry ethyl ether at 0 was added lithium aluminum hydride 
(380 mg, 10 mmol) . After stirring at 25 °C for 8 hours, 
the reaction mixture was poured into cold 3 M HCl. The 
aqueous solution was then extracted with ethyl ether 
twice. The organic layers were combined, washed with 
brine and dried over MgS04 . Removal of the solvent gave 
4-d5 ethoxyphenethyl alcohol (850 mg, 100%) as a white 




solid. 



35 
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NMR (CDC13) 2.79 (2H, t, J= 6.6 Hz), 3.80 (2H, t, J= 6.5 
Hz), 6.83 {2H, d, J= 8.6 Hz), 7.12 (2H, d, J= 8.6 Hz). 

To a solution of 4-d5 ethoxyphenylacetate (800 mg, 4 mmol) 
5 in dry ether at 0 °C was added lithium aluminum deuteride 
(340 mg, 8 mmol) . After stirring at 25 <*C for 8 hours, 
the reaction mixture was poured into cold 3 M HCl. The 
aqueous solution was then extracted with ethyl ether 
twice. The organic layers were combined, washed with 
10 brine and dried over MgS04. Removal of the solvent gave 4- 
dg ethoxy dj-phenethyl alcohol (680 mg, 98%) as a white 
solid. 

H^ NMR (CDC13) 2.79 (2H, s) , 6.85 (2H, d, J= 8.5 Hz), 7.13 
15 (2H, d, J= 8.5 Hz) . 

Scheme 12 displays a table of 7 different tags by 
combination of the compound in the horizontal column with 
the compound in the corresponding vertical column. 



wo 95/28640 



PCT/US95/04683 



-137- 
Scheme 12 



Mass Tag 

CH3CH2OH CD3CH2OH CD3CD2OH 

OH 




Mass=M+2 
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Assessment of d-7 or d-5 deuterium samples 



OH OH 



5 




OCD2CD3 OCD2CD3 

d d 

5 7 



Sensitivity requires that the samples are derivatized to 
form the trimethyl silyl esters using derivatizing agent 

15 bis (trimethylsilyl) trif luoroacetamide (BSTFA) 
quantitatively for 30 minutes at room temperature. The 
reagent and solvent are removed at room temperature under 
a stream of (g) . The samples are resolvated with ethyl 
acetate and injected on column for analysis using positive 

20 chemical ionization mass spectrometry. The d-7 and d-5 
components cannot be separated on the GC but mass 
spectrometry allows the ratio of the two components 
present to be determined when approximately 20 ng is 
injected on column using the Hewlett-Packard GC/MS. The 

25 (M+NH4)* ion observed in Figure 6 for the mass spectrum of 
the derivativized d-5 and d-7 sample show a m/z = 263 for 
d-7 and m/z ^ 261 for d-5 which agree with the calculated 
ratio. Fragments for the para-ethoxy-benzyl cation are 
also observed in Figure 6 at m/z = 156 for d-7 and m/z = 

30 154 for d-5. Sensitivity may be increased to 1 ng or less 
injected on column if the more sensitive JEOIi SX-102 high 
resolution mass spectrometer were used. The JEOL SX-102 
is about 2 orders of magnitude more sensitive than the 
Hewlett-Packard system. Figure 6 shows spectra and 

35 chromatograms showing the improvement in chromatography 
due to derivatization. Figure 5 shows mass spectrum and 
chromatogram for the underivativzed d-7 sample only. 
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It is evident from the above description that the subject 
invention provides a versatile, simple method for 
identifying compounds, where the amount of compound 
present precludes any assurance of the ability to obtain 
5 an accurate determination of its reaction history. The 
method allows for the production of extraordinarily large 
numbers of different products, which can be used in 
various screening techniques to determine biological or 
other activity of interest. The use of tags which are 

10 chemically inert under the process conditions allows for 
great versatility in a variety of environments produced by 
the various synthetic techniques employed for producing 
the products of interest. The tags can be readily 
synthesized and permit accurate analysis, so as to 

15 accurately define the nature of the composition. 

All publications and patent applications cited in this 
specification are herein incorporated by reference as if 
each individual publication or patent application were 
20 specifically and individually indicated to be incorporated 
by reference. 

Although the foregoing invention has been described in 
some detail by way of illustration and example for 
25 purposes of clarity of understanding, it will be readily 
apparent to those of ordinary skill in the art in light of 
the teachings of this invention that certain changes and 
modifications may be made thereto without departing from 
the spirit or scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1. A method for recording the reaction history of a 
reaction series on each of a plurality of unique 
solid supports, wherein said reaction series involves 
5 at least two stages requiring differing agents or 

reaction conditions resulting in a different 
modification as to a plurality of said unique solid 
supports, resulting in a plurality of different final 
products on different unique solid supports, 

10 employing a combination of identifiers for recording 

said reaction history, said identifiers characterized 
by defining the choice of agent or reaction condition 
and the stage in said reaction series and being 
capable of being analyzed as to the choice and stage, 

15 said method comprising: 

reacting, at a first or intermediate stage of said 
series, a different agent or employing a different 
reaction condition with each of a group of said 
unique solid supports, said group comprising at least 

20 one of said unique solid supports, and a combination 

of identifiers wherein said combination of 
identifiers defines the choice of agent and the stage 
in said series as to each group of said unique solid 
supports, each of said identifiers being individually 

25 bound to said unique solid support directly or 

through other than a prior identifier; 
mixing said groups together and then dividing said 
plurality of unique solid supports into a plurality 
of groups for a second intermediate or final stage; 

30 and 

repeating said reacting at least once to provide a 
plurality of final products, having different 
products on the different individual unique solid 
supports . 
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A method according to claim 1, wherein at least 100 
iinique solid supports and at least 2 groups are 
employed in each said reacting, 

A method according to claim 1, including the 
additional stages of screening said final products on 
said unique solid supports for a characteristic of 
interest; and identifying the reaction history of at 
least one final product having said characteristic of 
interest . 

A method of claim 1, further comprising cleaving the 
final product from the solid support and screening 
said final product. 

A method of claim 1, further comprising treating the 
identifiers so as to detach tag components from the 
solid supports and reacting said tag components with 
a moiety capable of detection by fluorescence or 
electron capture. 

A method of claim 5, wherein the detaching is done 
photochemical ly or oxidatively and the detectable 
moiety is derived from dansyl chloride or a 
polyhalobenzoylhalide . 

A method of claim 1, wherein the identifier comprises 
tag components which have two characteristics, a 
characteristic capable of separation and a 
characteristic capable of detection. 

A method according to claim 1, wherein said 
characteristic capable of detection is the ability to 
be detected by electron capture. 
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9. A method according to claim 1, wherein said 
characteristic capable of detection is the ability to 
be detected by mass spectroscopy. 

5 10. A method of claim 9, wherein the mass spectropscopy 
detects the presence of deuterium atoms. 

11. A method according to claim 1, wherein said 
characteristic capable of detection is radioactivity. 

10 

12. A method according to claim 7, wherein said 
characteristic capable of detection is fluorescence. 

13. A method according to claim 7, wherein said tags may 
15 be separated by means of chromatography. 

14. A kit comprising a plurality of different separated 
organic compounds, each of the compounds 
characterized by having a distinguishable 

20 composition, encoding at least one bit of different 

information which Ccoi be determined by a physical 
measurement and sharing at least one common 
fvmctionality, 

25 15. A kit of claim 14, comprising at least 4 different 
functional organic compounds, 



30 



16. A kit according to claim 14, wherein said functional 
organic compounds are of the formula: 



where F^-F^ is a linker which allows for attachment to 
and detachment from a solid particle; and 
35 C-E-C' is a tag which can be determined by a physical 

measurement . 
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17. A kit according to claim 16, wherein said functional 
organic compounds differ by the number of methylene 
groups and/or halogens, nitrogens or sulfurs present. 

5 18. A kit according to claim 16, wherein the C-E-C' 
^ portion can be removed photochemically . 

19. A kit according to claim 16, wherein the C-E-C' 
portion can be removed oxidatively, hydrolytically, 

10 thermolytically, or reductively. 

20. A solid support characterized by having a ligand 
bound thereto and having a combination of identifiers 
bound to said solid support. 

15 

21. A solid support according to claim 20, wherein said 
ligand is an oligomer which is an oligopeptide, 
oligonucleotide , oligosaccharide , polylipid, 
polyester, polyamide, polyurethane, polyurea, 

20 polyether, poly (phosphorus derivative) which is a 

phosphate , phosphona t e , phosphor amide , 
phosphonamidey, phosphite, or phosphinamide , poly 
(sulfur derivative) which is a sulfone, sulfonate, 
sulfite, sulfonamide, or sulfenamide, where for the 

25 phosphorous and sulfur derivatives the indicated 

heteroatom for the most part will be bonded to C, H, 
N, O or S, and combinations thereof. 

22. A solid support according to claim 20, wherein said 
30 ligand is a non-oligomer which is heterocyclic, 

aromatic, alicyclic, or aliphatic, and combinations 
thereof . 



35 



23. 



A solid support of claim 22, wherein the non-oligomer 
is a diazabicyclic, an azatricyclic, or a branched 
amide compound. 
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24. A solid support of claim 20, wherein the ligand is 
linked to the support through a non- labile linkage. 

25. A solid support of claim 20, wherein the ligand is 
5 linked to the support through a cleavable linkage. 

26. A solid support according to claim 20, wherein the 
identifier comprises tags, the tags being 
radioisotopes, or haloalkyl or haloarylallyl 

10 containing compounds. 

27. A solid support of claim 20, which is a bead of about 
10-2000 fim in diameter, and wherein the identifiers 
comprise tag components which after cleavage from the 

15 bead can be separated by gas chromatography and or 

liquid chromatography detected by electron capture, 
mass spectroscopy, fluorescence, or atomic emission 
techniques . 

20 28. A library comprising a plurality of solid supports 
according to claim 23. 

29. A library of claim 28, wherein the final products 
have been cleaved from the solid support. 

25 

30. A library of claim 29, wherein the final products are 
a diazabicyclic, azatricyclic, or branched amide 
compounds. 

3 0 31. A process for identifying compounds having a 
characteristic of interest which comprises screening 
a library of claim 28. 

32. A process of claim 31, wherein the compounds have 
35 been cleaved from the solid surface. 
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A process of claim 32, wherein the compound is a 
diazabicyclic, azatricyclic, or branched amide 
compound • 

A method for producing a ligand involving a reaction 
series employing a method for recording the reaction 
history of a reaction series on each o£ a plurality 
of unique solid supports, wherein said reaction 
series involves at least two stages requiring 
differing agents and/:;r reaction conditions resulting 
in a different modification as to a plurality of said 
unique solid supports, resulting in a plurality of 
different final products on different unique solid 
supports, employing a combination of identifiers for 
recording said reaction history, said identifiers 
characterized by defining the choice of agent or 
reaction condition and the stage in said series and 
being capable of being analyzed as to the choice and 
stage, said method comprising: 

reacting, at a first or intermediate stage of said 
series, a different agent or employing a different 
reaction condition with each of a group of said 
unique solid supports, said group comprising at least 
one of said unique rolid supports, and a combination 
of identifiers wherein said combination of 
identifiers defines the choice of agent and the stage 
in said series as to each group of said unique solid 
supports, each of said identifiers being individually 
boxind to said imique solid support directly or 
through other than a prior identifier; 
mixing said groups together and then dividing said 
plurality of unique solid supports into a plurality 
of groups for a second intermediate or final stage; 
repeating said reacting at least once to provide a 
plurality of ligands, having different products on 
the different individual unique solid surfaces; and 
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identifying said reaction history of at least one 
selected unique solid surface by means of said 
combination of identifiers. 

A ligand according to claim 34, wherein said 
identifying includes the stage of screening said 
ligands for a characteristic of interest. 

A method for producing a ligand involving a reaction 
series employing a method for recording the reaction 
history of a reaction series on each of a plurality 
of unique solid surfaces, wherein said reaction 
series involves at least two stages requiring 
differing agents and/or reaction conditions resulting 
in a different modification as to each of a plurality 
of said unique solid surfaces, resulting in a 
plurality of different ligands on different tinique 
solid surfaces, employing combinations of identifiers 
for recording said reaction history, said combination 
of identifiers characterized by defining the choice 
of agent and/or reaction condition and the stage in 
said series and being capable of being analyzed as to 
the choice and stage, said method comprising: 
reacting, at a first or intermediate stage of said 
series, a different agent and/or employing a 
different reaction condition with each of a group of 
said unique solid surfaces, said group comprising at 
least one of said unique solid surfaces, and a 
combination of identifiers wherein said combination 
of identifiers defines the choice of agent and the 
stage in said series as to each group of said unique 
solid surfaces, each of said identifiers being 
individually bound to said unique solid surface 
through other than a prior identifier by a cleavable 
link; 
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mixing said groups together and then dividing said 
plurality of iinique solid surfaces into a plurality 
of groups for a second intermediate or final stage; 
repeating said reacting to provide a plurality of 
ligands having different ligands on the different 
individual unique solid surfaces; 

screening the ligands from a plurality of each of 
said unique solid surfaces for a characteristic of 
interest; and 

identifying said reaction history of at least one 
selected unique solid surface having ligand having 
said characteristic of interest by detaching the tag 
members from said unique solid surface and 
identifying said tag members by means of a differing 
characteristic . 

A method according to claim 36, wherein said tags 
differ in an homologous series and are detected by 
electron capture gas chromatography or mass 
spectroscopy - 

A compoxind of the Formula I: 

where F^-F^ is a linker which allows for attachment to 

and detachment from a support; and 

C-E-C' is the tag which is capable of analysis; 

E is a tag component which allows for detection, or 

allows for detection eind provides for separation as 

a result of variable substitution; 

C and £' are tag components which allow for 
individual detection; 

F^ is a linking component capable of being selectively 
cleaved to release the tag components; and 
F^ is a functional group which allows ready attachment 
of the compound to a synthesis support. 
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39. A compound of claim 38, having the formula: 

F^-F^- (C{E-C') 

wherein: 

is CO2H, CHjX, NR^R^ C{0)RS OH, CHNj, SH, CiOCHNj, 
5 S(02)C1, S(02)CHNj, Nj, NO2, NO, S(02)N3, OC(0)X, » 

C(0)X, NCO, or NCS; 
p2 is 
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— S — C(R )2 A— . — C(X)R^^ C(r')2 a— , 



— C(OH)R^^ C(R^2A — . — C(OH)R — C(CH2X)R— , 

— C(OH)R— C(R^2 — C(X)R^ — , — CCOHXCHgCHgX) — . 




1 

R 
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with the proviso that when is a bond, is OH or 
COOH; 

A is -O, -0C(0)0-, -0C(0)-, or -NHC(O)-; 
C is a bond, C1-C20 alkylene optionally substituted by 
5 1-40 

F, CI, Br, Ci-Cg alkoxy, NR^RS 0R\ or NRS or 

- [ (C(R^)2)n,-Y-Z-Y- (C(R^)2)„Y-Z-Y]p-; with the proviso 
that the maximum number of carbon atoms in C+C' is 
20; 

10 CI is H; F; Cl; Ci-Cjo alkylene optionally substituted 

by 

1-40 F, CI, Br, Ci-Cg alkoxy, NR*R\ OR*, or NR*, or ' 

- [(C(R*)2)ni"Y-Z-Y-(C(R*)2)nY-Z-Y]p-; with the proviso 
that the maximum number of carbon atoms in C+C' is 

15 20; 

E is Ci-Cio alkyl substituted by 1-20 F, Cl or Br; or 
Q-aryl 

wherein the aryl is substituted by 1-7 F, Cl, NO2, 
SOaR^, or stibstituted phenyl wherein the substituent 
20 is 1-5 F, Cl, NO2, or SOaRS' 

E-C' may be -H, -OH, or amino; 
R^ is H or C^-Cg alkyl; 

R^ is C=0, C(O)0, C{0)NRS S, SO, or SO2; 
R* is H or Ci-Cfi alkyl; 
25 rs is Ci-Cg alkyl; 

a is 1-5; 
b is 1-3; 

m and n is each 0-20; 
p is 1-7; 

30 Q is a bond, O, S, NR^ C=0, -C(0)NR^ -NR5C(0)-, - 

C(0)0-, 
or -0C(0)-; 

X is a leaving group such as Br, Cl, triflate, 
mesylate, 
35 tosylate, or 0C(0)0RS- 

Y is a bond, O, S, or NR*; 
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Z is a bond; phenyl optionally substituted by 1-4 F, 
CI, Br, Ci-Cg alkyl, C^-Cs alkoxy, Ci-Cg alkyl 
substituted by 1-13 F, Cl, or Ci-Cg alkyloxy 
substituted by 1-13 F, Cl, or Br; (C(R*)2)i-2o; or 
5 (CF2)i.2o; with the proviso that when Z is a bond one 

of its adjacent Y's is also a bond and aryl is a 
mono- or bi -cyclic aromatic ring containing up to 10 
carbon atoms and up to 2 heteroatoms selected from O, 
S, and N. 

10 

40, A compound of claim 39, wherein: 
F^ is 



COgH. OH, CHN2> C(0)CHN2. C(0)X. NCS. or CH2X; 




C and C\ is each independently C1-C20 alkylene 
35 unsubstituted or substituted by 1-40 F or Cl, or [O- 



(CH2)2.3]p; 
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E is Ci-Cio alkyl substituted by 1-20 F or CI; Q-aryl 
where aryl is a bi-cyclic aromatic ring substituted 
by 1-7 F or CI; or Q-phenyl substituted by 1-5 F, Cl, 
NO2, or SOaR^; and 
5 Q is a bond, O, -m^CiO)-, or -0C(0)-. 



41. A compound of claim 39, having the formula: 

NO2 
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OMe 



5 




10 



0— (CHgl.s — 0 Ar 



and 
SO2CHIT2 

20 



25 




wherein Ar is pentafluoro- pentachloro- , or 
30 pentabromophenyl , 2,3,5, 6-tetraf luoro-4 (2,3,4,5,6- 

pentaf luorophenyl ) phenyl , 2,4, 6 - trichlorophenyl , 
2,4, 5 -trichlorophenyl, 2, 6-dichloro-4-f luorophenyl, 
or 2,3,5,6 -tetraf luorophenyl . 




35 



42. A compound of claim 39, wherein: 
E-C' is H, OH, or NHj. 
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43 . A composition of the formula 

S-F^' -F^-C-E-C' 

wherein: 

S is a soluble or solid support; 

C-E-C'is the tag which is capable of analysis where 
E is a tag component which (a) allows for detection, 
such as an electrophoric group which can be analyzed 
by gas chromatography or mass spectroscopy or (b) 
allows for detection and for separation as a result 
of variable substitution; 

C and C' are tag components which allow for 
distinguishing one tag from all other tags, usually 
allowing for separation as a result of variable 
length or siibstitution, for example, varying the 
chromatographic retention time or the mass 
spectroscopy ratio 2/e; 

F^ is a linking component capable of being selectively 
cleaved to release the tag; and 

F^' is a functional group which provides for 
attachment to the support. 

44. A composition of claim 43, wherein: 

S is a capillary, hollow fiber, needle, solid fiber, 
cellulose bead, pore-glass bead, silica gel, 
polystyrene bead optionally cross -linked with 
divinylbenzene, grafted co-poly bead, poly-acrylamide 
bead, latex bead, dimethylacrylamide bead optionally 
cross-linked with N,N' -bis-acryloyl ethylene diamine, 
glass particles coated with a hydrophobic polymer, or 
low molecular weight non-cross -linked polystyrene; 
and 

F^' -F^-C-E-C' is the residue of Formula I attached to 
S. 

45. The method of claim 1, wherein the combination of 
identifiers defines a binary coding scheme. 
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46. The method of claim 1, wherein the identifiers are of 
claim 38. 

47. The method of claim 1, wherein the identifiers are of 
claim 39. 

48. The method of claim 1, wherein the identifiers are of 
claim 40. 

49. The method of claim 1, wherein the identifiers are of 
claim 42 . 

50. The method of claim 1, further comprising detaching 
the tag members from said unique solid surfaces. 

51. The method of claim 50, wherein the tag members are 
detached photochemical ly, oxidatively, 
hydrolytically, thermolytically, or reductively. 

52. The method of claim 1, further comprising detaching 
non-oligomer ligands from said unique solid surfaces 
photochemically. 



95/28640 



PCT/US95/04683 



-156- 

53 . A compound of the formula 




0 R 



wherein: 

P is a polystyrene resin; 

IXa_f is a plurality of residues of the formula 



CI 




0 CI 



wherein : 
n is 1 - 6; 

R is CH3, CH(CH3)2, CH2CO2H, (CH2)4NH2, CH2-C6H4-OH, or 
CH2C6H5; and 
is H, CH3, C2H5, CH2CH=CH2, or CHjCfiHg. 

54 . A method of synthesizing a chemical compound so that 
the structure of the compound is readily 
determinable, which comprises synthesizing the 
compound on the surface of a solid support under 
conditions such that the solid support at the 
completion of the synthesis of the compound has bound 
to it a plurality of identifiers which encode the 
reaction stages associated with the synthesis of the 
compound. 
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55- A method of synthesizing a library of chemical 
compounds so that the structure of each compound in 
the library is readily determinable which comprises 
synthesizing each compound on the surface of a unique 
solid support under conditions such that each such 
unique support at the completion of the synthesis of 
the library of compounds has bound to it a plurality 
of identifiers which encode the reaction stages 
associated with the synthesis of the compound 
synthesized on such solid support, 

56. A method of determining the structure of a chemical 
compound which comprises synthesizing the compound by 
the method of claim 54 or 55, isolating the solid 
support upon which the compound was synthesized, 
treating the, solid support so isolated so as to cause 
the tag components of each of the identifiers bound 
to the solid support to be released, determining the 
identity or quantity or both of each tag component so 
released, and deriving the structure of the compoxind 
from the identities or quantities or both of all such 
tag components. 

57. A method of identifying a compound having a desired 
characteristic which comprises synthesizing a library 
of chemical comp^ounds by the method of claim 55, 
separately testing each of the compounds in the 
resulting library in an assay which identifies 
compounds having the desired characteristic so as to 
identify any compounds present in the library which 
has the desired characteristic. 

58. A method of claim 57, further comprising determining 
the structure of the compound so identified. 

( 

59. A library of chemical compounds, each compound in the 
library being bound to a unique solid support and 



wo 95/28640 



PCT/US95/04683 



-158- 

each such solid support having bound to it a 
plurality of identifiers which encode the reaction 
stages associated with the synthesis of the compound 
bound to such solid support. 

60. A library of claim 59, wherein compounds in the 
library are diazabicyclic compounds. 

61. A library of claim 59, wherein compounds in the 
library are azatricyclic compounds. 

62. A library of claim 59, wherein compounds in the 
library are branched amide compounds. 

63. A library of claim 59, wherein compound in the 
library are peptides. 

64 . A method of identifying a compound having a desired 
characteristic which comprises testing a library of 
chemical compounds according to claim 58 in an assay 
which identifies compounds having the desired 
characteristic so as to identify any compound present 
in the library which have the desired characteristic . 

65. A method of claim 64, further comprising determining 
the structure of the compound so identified. 

66. A compound identified by the method of claim 64. 

67. A method of claim 64, wherein the desired 
characteristic is antagonism for the human neurokinin 
1/brandykin receptor and the library of chemical 
compounds comprises azatricyclic compounds. 

68. A method of claim 64, wherein the desired 
characteristic in usefulness as a muscle relaxant, a 
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tranquilizer or a sedative and the library of 
chemical compounds comprises bezodiazepines . 

69. A method of claim 64, wherein the desired 
5 characteristic is useful in the treatment of 

hypertension or Raynaud's syndrome and the library of 
chemical compotinds comprises branched amides, 

70, A composition comprising at least 6 different 
10 components, said comprrents being associated in a kit 

or in a common medium, each component having a 
distinguishable moiety, said components being 
characterized by: 



15 each distinguishable moiety being substantially 

chemically inert, differing in molecular weight, and 
having and identifiable characteristic different from 
each of the other distinguishable moieties other than 
light emission; 

20 

each moiety joined to a linking group having an 
active functionality capable of forming a covalent 
bond, through a linking unit to individually 
separable solid surfaces, or joined to a group which 
25 is detectable at less than 1 nanomole. 



with the proviso that when said moieties are joined 
to said linking group, said components are physically 
segregated. 

30 

71. A composition according to claim 70, wherein said 
solid supports are beads. 

72. A composition according to claim 70, wherein each 
35 component comprises molecules of different compounds 

bound to individual separable solid surfaces, wherein 
said distinguishable moieties encode the synthetic 



/ 
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protocol used to synthesize said molecules on said 
solid surfaces. 

73. A composition according to claim 70, wherein said 
5 distinguishable moieties define an homologous series 

and/or a series of substituents on a core molecule. 

74. A composition comprising at least 6 different markers 
said markers being associated in a kit or in a common 

10 medium, each marker having a distinguishable moiety 

which is substantially chemically inert, differing 
from each other in molecular weight, said markers of 
the formula: 

M5 (1) A-{A-(T)« or {T)«-A or A^-CT)^-^^} 

wherein 

A is a linking group comprising a functionality for 
20 bonding to a solid support and a functionality for 

detachment from the solid support, which may be 
included in the functionality for bonding to. the 
solid support; 

25 A is a distinguishing group, which allows for the 

distinguishing by a physical characteristic of each 
of the markers from each of the other markers by 
other than fluorescence, so as to provide a set of 
markers which allows for coding a multistep synthetic 

30 procedure, and includes any remaining functional 

group after detachment, which was previously 
associated with the linked group; 



35 



and A^ are portions of the distinguishing group, 
which together define a distinguishing group and when 
joined together come within the definition of A; 
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T is a detectable group, which when attached to the 
distinguishing group allows for the low level 
detection of the marker, where the detectable group 
may be present on the marker in the kit or may be 
5 added later to the distinguishing group and if 

attached to the linking group, includes any remaining 
functional group after detachment, which was 
previously associated with the linking group; and 

10 a is 0 or 1, indicating that the detectable group may 

or may not be present; 

(2) ss-(A'-{A-(T)„ or (T)„-A or A^- (T) ^-A^) ) ^ 

15 wherein: 

all of the symbols have been defined previously, 
except as follows: 

20 ss is a solid support; 

A' is a linking group covalently bonded to ss; and 

/S is an integer as to each solid support and is at 
25 least six and usually not more than about 30; 

(3) A"-{A"-{T)„ or (T")a-A or A' (T) 
wherein: 

30 

all of the symbols have been defined previously, 
except as follows : 

A" is hydrogen or the residue of the linking group 
35 after photolytic cleavage, elimination or other 

chemical reaction which results in detachment from 
the solid support; 
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A" or A'^ is A or A^ or a modified A or A^, 
respectively, as a result of the detachment of 
the marker from the solid support; 

5 T" is T or a modified T as a result of the detachment 

of the marker from the solid support; 

(4) T«-A' "-{A"- (T)„ or (T")^-A or A'^-{T)«-a2) 
wherein: 

10 

all of the symbols have been defined previously and 

A'" is a bond or the remaining portion of the linker 
group after attachment to T; 

with the additional limitation that only one Of is 1, 

75. A composition according to claim 74, wherein the 
markers are of the formula (1) . 

76. A composition according to claim 74, wherein the 
markers are of the formula (2) . 

77 . A composition according to claim 74 , wherein the 
25 markers are of the formula (3) . 

78. A con^osition according to claim 74, wherein the 
markers are of the formula (4) . 

30 79. A composition according to claim 74, wherein A is 
bound to A. 



15 



20 



35 



80. 



A compound library comprising at least one hundred 
unique solid supports characterized by: 
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each solid support having (1) an individual compound 
bound to said solid support as the major compound 
bound to said support; and 

5 (2) a plurality of nonsequencable markers, where 

said markers are individual molecules which are 
physically distinguishable in being physically 
separable, differing in molecular weight and being 
distinguishable by other than fluorescence and are 
10 substituted so as to be detectable at less than about 

a nanomole or have a fiinctional group for bonding to 
a substituent which is detectable at less than about 
a nanomole. 

15 81. A compound library according to claim 80, wherein 
each solid support has at least about 6 markers . 

82. A compound library according to claim 80, wherein 
said markers define a binary or higher code encoding 

20 the synthetic protocol used for synthesizing said 

compound on said solid support. 

83. The compound of claim 38, wherein is CHN2, 
C(0)CHN2, 8(03) CHN2, N3, NO2, or SO2N3. 



25 



84. The compound of claim 38, wherein is COOH, CHNj, 
C(0)CHN2/ S(02)CHN2, COCl, OH, SH, CH2X, or NHRS- 
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wherein is 

.NO 




wherein A is -O- or -OC(0)0-; 

5 £(E-C' )3 is -(CRS)i.:,s-(0)o.i-Ar, - (CR^) o-is" (CF2) 1-15-F, 
' -(CR%)o-i5-{CF2)i.x5-(CR%)o.i5-H, or 
- (CH2) ,.20- ( (O) 0.1- {CH2) 1.,,) 0-24- (CH2) 0-24- (O) 0-1-Ar , 

wherein Ar is pentaf luoro- , pentachloro- , or 
10 pentabromophenyl, 2,3,5,6- tetraf luoro-4 (2,3,4,5,6- 

pentaf luorophenyl ) phenyl , 2,4, 6 - trichlorophenyl , 
2,4, 5- trichlorophenyl , 2,6 -dichloro-4 - f luorophenyl , 
or 2, 3, 5, 6 -tetraf luorophenyl. 
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